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(57) Abstract: Described herein are methods that can be used for diagnosis of angiogenesis and angiogenic phenotypes. Also de- 
J scribed herein are methods that can be used to screen candidate Woactive agents for the ability to modulate angiogenesis. Additionally, 
► methods and molecular targets (genes and their products) for therapeutic intervention in disorders associated with angiogenesis are 

described. 
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NOVEL METHODS OF DIAGNOSIS OF ANGIOGENESIS, COMPOSITIONS AND METHODS 
OF SCREENING FOR ANGIOGENESIS MODULATORS 

FIELD OF THE INVENTION 

The invention relates to the identification of expression profiles and the nucleic acids Involved in 
5 angiogenesis, and to the use of such expression profiles and nucleic acids In diagnosis of 

angiogenesis. The invention further relates to methods for identifying candidate agents and/or targets 
which modulate angiogenesis. 

BACKGROUND OF THE INVENTION 

New blood vessel development comprises the formation of veins (vasculogenesis) and arteries 
10 (angiogenesis). Angiogenesis plays a normal role in embryonic development, as well as menstration, 
wound healing. Angiogenesis also plays a crucial pathogenic role in a variety of disease states, 
including cancer, proliferative diabetic retinopathy, and maintaining blood flow to chronic Inflammatory 
sites. 

Angiogenesis has a number of stages. The early stages of angiogenesis include endothelial cell 
1 5 protease production, migration of cells and proliferation. The early stages also appear to require some 
growth factors, with VEGF, TGF-a, angiostatin, and selected chemokines all putatively playing a role. 

Later stages of angiogenesis include the population of the vessels with mural cells (pericytes or 
smooth muscle cells), basement membrane production and the induction of vessel bed 
specializations. The final stages of vessel formation include what is known as "remodeling", wherein a 
2 0 fonning vasculature becomes a stable, mature vessel bed. 

Thus, understanding the genes, proteins and regulatory mechanisms that occur during angiogenesis 
would be desirable. Accordingly, it is an object of the invention to provide methods that can be used to 
screen candidate bioactive agents for the ability to modulate angiogenesis. Additionally, It Is an object 
to provide molecular targets for therapeutic intervention in disease states which either have an 
2 5 undesirable excess or a deficit in angiogenesis. 



1 



wo 01/11086 



PCT/USOO/22061 



SUMMARY OF THE INVENTION 

The present invention provides novel methods for diagnosis and prognosis evaluation for 
angiogenesis, as w/ell as methods for screening for compositions which modulate angiogenesis. 
Methods of treatment of disorders associated with angiogenesis, as well as compositions are also 
5 provided herein. 

In one aspect, a method of screening drug candidates comprises providing a cell that expresses an 
expression profile gene or fragments thereof, or fragments thereof. Preferred embodiments of the 
expression profile gene are genes which are differentially expressed in angiogenesis cells, compared 
to other cells. Preferred embodiments of expression profile genes used in the methods herein include 

10 but are not limited to the group consisting of AAA4, AAA1, Edg-1, alpha 5 betal integrin, endomucln 
and matrix metalloproteinase 10; fragments of the proteins of this group are also preferred. It is 
understood that molecules for use in the present invention may be from any figure or any subset of 
listed molecules. Therefore, for example, any one or more of the genes listed above can be used In 
the methods herein. In another embodiment, a nucleic acid is selected from Tables 1 , 2, 3, 4 or 5. 

15 Prefen-ed nucleic acids are in Table 4, and most preferably Table 5. The method further includes 

adding a drug candidate to the cell and determining the effect of the drug candidate on the expression 
of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes comparing the level of 
expression in the absence of the drug candidate to the level of expression in the presence of the drug 

2 0 candidate, wherein the concentration of the drug candidate can vary when present, and wherein the 

comparison can occur after addition or removal of the drug candidate. In a preferred embodiment, the 
cell expresses at least two expression profile genes. The profile genes may show an increase or 
decrease. 

Also provided herein is a method of screening for a bioactive agent capable of binding to an 
25 angiogenesis modulator protein (AMP), the method comprising combining the AMP and a candidate 
bioactive agent, and determining the binding of the candidate agent to the AMP. Preferably the AMP 
is a protein or fragment thereof selected from the group consisting of AAA4, AAA1 , Edg-1 , alpha 5 
betal integrin, endomucln and matrix metalloproteinase 10. In another embodiment, the proteins is 
encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred nucleic acids are in Table 4, 

3 0 and most preferably Table 5. 
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Further provided herein is a method for screening for a bioactive agent capable of modulating the 
activity of an AiVIP. In one embodiment the method comprises combining the AMP and a candidate 
bioactive agent, and determining the effect of the candidate agent on the bioactivity of the AMP. 
Preferably the AMP is a protein or fragment thereof selected from the group consisting of AAA4, 
5 AAA1, Edg-1 , alpha 5 beta1 integrin, endomucin and matrix metalloproteinase 10. In another 

embodiment, the proteins is encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred 
nucleic acids are in Table 4, and most preferably Table 5. 

Also provided is a method of evaluating the effect of a candidate angiogenesis drug comprising 
administering the drug to a transgenic animal expressing or over-expressing the AMP, or an animal 

1 0 lacking the AMP, for example as a result of a gene knockout. 

Additionally, provided herein is a method of evaluating the effect of a candidate angiogenesis drug 
comprising administering the drug to a patient and removing a cell sample from the patient. The 
expression profile of the cell is then determined. This method may further comprise comparing the 
expression profile to an expression profile of a healthy individual. In a preferred embodiment, the 
1 5 expression profile includes a gene of Table 1 , Table 2, Table 3, Table 4 or Table 5. 

Moreover, provided herein is a biochip comprising one or more nucleic acid segments which encode 
an angiogenesis protein, preferable selected from the group consisting of AAA4, AAA1, Edg-1, alpha 5 
betal integrin, endomucin and matrix metalloproteinase , or fragment thereof, wherein the biochip 
comprises fewer than 1000 nucleic acid probes. Preferably at least two nucleic acid segments are 

2 0 included. In another embodiment, the nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred 

nucleic acids are in Table 4, and most preferably Table 5. 

Furthermore, a method of diagnosing a disorder associated with angiogenesis is provided. The 
method comprises determining the expression of a gene which encodes an angiogenesis protein 
preferable selected from the group consisting of AAA4, AAA1 , Edg-1 , alpha 5 betal integrin, 
25 endomucin and matrix metalloproteinase 10, or fragment thereof in a first tissue type of a first 

individual, and comparing the distribution to the expression of the gene from a second normal tissue 
type from the first individual or a second unaffected individual. In another embodiment, the proteins is 
encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred nucleic acids are in Table 4, 
and most preferably Table 5. A difference In the expression indicates that the first individual has a 

3 0 disorder associated with angiogenesis. 
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In another aspect, the present invention provides an antibody which specifically binds to an 
angiogenesis preferably selected from the group consisting of AAA4, AAA1, Edg-1, alpha 5 beta1 
integrln, endomucin and matrix metalloproteinase 1 0 or fragment thereof. . In another embodiment, 
the proteins is encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Prefen-ed nucleic acids 
5 are in Table 4, and most preferably Table 5. In a preferred embodiment the fragment of AAA1 is 
selected from AAA1p1 or AAA1p2. Other preferred fragments for the angiogenesis proteins are 
shown in the figures. 

In one embodiment a method for screening for a bioactive agent capable of interfering with the binding 
of a angiogenesis modulating protein (AMP) or a fragment thereof and an antibody which binds to said 
1 0 AMP or fragment thereof. In a preferred embodiment, the method comprises combining an AMP or 
fragment thereof, a candidate bioactive agent and an antibody which binds to said AMP or fragment 
thereof. The method further includes determining the binding of said AMP or fragment thereof and 
said antibody. Wherein there is a change in binding, an agent is identified as an interfering agent. 
The interfering agent can be an agonist or an antagonist. Preferably, the agent inhibits angiogenesis. 

15 In a further aspect, a method for inhibiting angiogenesis Is provided. In one embodiment, the method 
comprises administering to a cell a composition comprising an antibody to an angiogenesis 
modulating protein, preferably selected from the group consisting of AAA4, AAA1, Edg-1, alpha 5 
beta1 integrin, endomucin and matrix metalloproteinase 10, or fragment thereof. In another 
embodiment, the proteins is encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Prefen-ed 

2 0 nucleic acids are in Table 4, and most preferably Table 5. The method can be performed in vitro or in 

vivo, preferably in vivo to an individual. In a preferred embodiment the method of inhibiting 
angiogenesis is provided to an individual with a disorder associated with angiogenesis such as cancer. 
As described herein, methods of inhibiting angiogenesis can be performed by administering an 
inhibitor of the activity of an angiogenesis protein, including an antisense molecule to the gene or its 
25 gene products, and preferable small molecules. 

Also provided herein are methods of eliciting an immune response in an individual. In one 
embodiment a method provided herein comprises administering to an individual a composition 
comprising an angiogenesis modulating protein, preferably selected from the group consisting of 
AAA4, AAA1, Edg-1, alpha 5 betal integrin, endomucin and matrix metalloproteinase 10, or fragment 

3 0 thereof. In another embodiment, the proteins is encoded by a nucleic acid selected from Tables 1 , 2, 

3, 4 or 5. Prefen-ed nucleic acids are in Table 4, and most preferably Table 5. In another aspect, said 
composition comprises a nucleic acid comprising a sequence encoding an angiogenesis modulating 
protein, preferably selected from the group consisting of AAA4, AAA1 , Edg-1 , alpha 5 betal integrin, 



4 



wo 01/11086 



PCT/USOO/22061 



endomucin and matrix metalloproteinase 10, or fragment thereof. In another embodiment, the 
proteins is encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred nucleic adds are 
in Table 4, and most preferably Table 5. 

Further provided herein are compositions capable of eliciting an immune response in an individual. In 
5 one embodiment, a composition provided herein comprises an angiogenesis modulating protein, 
preferably selected from the group consisting of AAA4, AAA1 , Edg-1 , alpha 5 betal integrin, 
endomucin and matrix metalloproteinase 10, or fragment thereof. In another embodiment, the 
proteins Is encoded by a nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred nucleic acids are 
in Table 4, and most preferably Table 5. In another embodiment, said composition comprises a 
10 nucleic acid comprising a sequence encoding an angiogenesis modulating protein, preferably selected 
from the group consisting of AAA4, AAA1, Edg-1, alpha 5 betal integrin, endomucin and matrix 
metalloproteinase 10, or fragment thereof, and a pharmaceutically acceptable can-ier. 

In another embodiment the nucleic acid selected from Tables 1 , 2, 3, 4 or 5. Preferred nucleic acids 
are in Table 4, and most preferably Table 5. 

15 A method of neutralizing the effect of an angiogenesis protein, preferably selected from the group 

consisting of AAA4, AAA1, Edg-1, alpha 5 betal integrin, endomucin and matrix metalloproteinase 10, 
or fragment thereof, comprising contacting an agent specific for said protein with said protein in an 
amount sufficient to effect neutralization. In another embodiment, the proteins is encoded by a nucleic 
acid selected from Tables 1, 2, 3, 4 or 5. Preferred nucleic acids are in Table 4, and most preferably 

20 Tables. 

In another aspect of the invention, a method of treating an individual for a disorder associated with 
angiogenesis is provided. In one embodiment, the method comprises administering to said individual 
an inhibitor of Edg-1 . In another embodiment, the method comprises administering to a patient having 
a disorder with angiogenesis an antibody to Edg-1 conjugated to a therapeutic moiety. Such a 
2 5 therapeutic moiety can be a cytotoxic agent or a radioisotope. 

Novel sequences are provided herein. Compounds and compositions are also provided. Other 
aspects of the invention will become apparent to the skilled artisan by the following description of the 
invention. 

DETAILED DESCRIPTION OF THE TABLES AND FIGURES 
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Table 1 provides the Accession numbers for 1 774 genes, including expression sequence tags, 
(incorporated in their entirety here and throughout the application where Accession numbers are 
provided), whose expression levels change as a function of time in tissue undergoing angiogenesis 
compared to tissue that is not. 

Table 2 provides the Accession numbers for a prefen-ed subset of 559 genes, including expression 
sequence tags (incorporated in their entirety here and throughout the application where Accession 
numbers are provided), whose expression levels change as a function of time in tissue undergoing 
angiogenesis compared to tissue that is not. The sequences are characterized as predicted to 
encode secreted proteins (SS), or transmembrane proteins (TM) proteins. 

Table 3 provides the Accession numbers for 1916 genes including expression sequence tags 
(incorporated In their entirety here and throughout the application where Accession numbers are 
provided), whose expression levels change as a function of time in tissue undergoing angiogenesis 
compared to tissue that is not. 

Table 4 provides a preferred subset of 558 Accession numbers identified in Figure 4 whose 
expression levels change as a function of time in tissue undergoing angiogenesis compared to tissue 
that is not. 

Table 5 provides a preferred subset of 20 Accession numbers identified in Figure 4 whose expression 
levels change as a function of time in tissue undergoing angiogenesis compared to tissue that is not. 

Figure 1 is a graph of expression levels of sequences identified in Figure 1 . Expression profiles are 
clustered into 4 groups. CI (blue), C2 (red), C3 (green) and C4 (mustard). 

Figure 2 shows an embodiment of a nucleic acid (mRNA) which includes a sequence encoding an 
angiogenesis protein, AAA4. The start and stop codons are underlined. 

Figure 3 shows the open reading frame of a nucleic acid sequence encoding AAA4. The start and 
stop codons are underlined. 

Figure 4 shows an embodiment of the amino acid sequence of AAA4. The signal peptide is double 
underlined, and the transmembrane sequence is underlined. In one embodiment herein, AAA4 is 
soluble. Thus, the signal peptide can be omitted, and the transmembrane domain deleted, 
inactivated, or truncated. 
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Figure 5 shows peptides AAA4p1 and AAA4p2. 

Figure 6 shows the expression of AAA4 in angiogenesis models over time and in other, non- 
angiogenic tissues. 

Figure 7 shows an embodiment of a nucleic acid sequence encoding an angiogenesis protein, AAA1 . 
5 A putative stop codon is underlined. 

Figure 8 shows an embodiment of an amino acid sequence for AAA1 A transmembrane domain is 
underlined. In one embodiment, AAA1 is soluble. In preferred embodiments, the transmembrane 
domain is deleted or inactivated, or AAA1 is tmncated to delete the transmembrane domain. 

10 Figure 9 shows AAA1 pi and AAA1 p2. 

Figure 1 0 shows a graph showing the relative expression of AAA1 in various tissues at different time 
points. "Exp 3" is an angiogenesis model showing tube formation over time using endothelial cells. 

Figure 1 1 shows an embodiment of a nucleic acid, mRNA, which comprises a sequence encoding an 
angiogenesis protein, Edg-1. The start and stop codons are underlined. 

15 Figure 1 2 shows the open reading frame encoding Edg-1 , wherein the start and stop codons are 
underlined. 

Figure 13 shows an embodiment of an amino acid sequence for an angiogenesis pnDtein, Edg-1 , 
wherein the transmembrane domains are underlined. In a prefen^d embodiment herein, a soluble 
2 0 form of Edg-1 is provided. In one embodiment, the transmembrane domains are deleted, inactivated, 
and/or the protein is truncated so as to exclude the domains (with or without re-ligation of remaining 
soluble regions). 

Figure 14 depicts four peptide sequences provided herein and their respective solubilities. 

Figure 15 shows the expression of Edg-1 over a variety of tissues. 

2 5 Figure 1 6 shows the time course of induction of Edg-1 in a model for angiogenesis (Expt 1 , Expt 2, 
Expt 3) in which low passage human endothelial cells form into tube structures over a period of a few 
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days in culture. The reproducible induction of Edg-1 occun-ed in a time frame consistent witti its role in 
the tube forming process. 

Figure 1 7 shows an embodiment of a nucleic acid sequence which includes the coding sequence for a 
tissue remodeling protein, alpha 5 beta 1 integrin (sometimes referred to as VLA-5), wherein the start 
and stop codon are underlined. 

Figure 18 shows an embodiment of an amino acid sequence of a tissue remodeling protein, alpha 5 

beta 1 integrin, wherein a transmembrane domain is underlined. 

Figure 1 9 shows a bar graph depicting the results of 5 expression profiles of alplia 5 beta 1 integrin 
throughout the time course of tube formation. In particular, tube models 1 , 2 and 3 show models 
which form tube structures from single isolated human endothelial cells; the "EC/PMA" model shows 
endothelial cells stimulated with pokeweed mitogen antigen, and the body atlas profile shows 
expression in various normal cell types and tissues. 

Figures 20A and 20B show the results of antagonism of tube formation wherein Figure 20A is an 

isotype control and Figure 20B shows specific antibody antagonism after 48 tiours. 

Figure 21 shows an embodiment of a nucleic acid sequence which includes the coding sequence for 
an angiogenesis protein, endomucin, wherein the start and stop codon are boxed. 

Figure 22 shows an embodiment of an amino acid sequence of an angiogenesis protein, endomucin, 
wherein a signal sequence is bolded and a transmembrane domain is underlined. 

Figure 23 shows an embodiment of a nucleic acid sequence which includes the coding sequence for 
an angiogenesis protein, matrix metalloproteinase 10 (also called stromolysin 2), wherein the start and 
stop codon are boxed. 

Figure 24 shows expression of matrix metalloproteinase 10 over a variety of tissues. 
Figure 25 shows expression of matrix metalloproteinase 10 over a variety of tissues. 
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In accordance with the objects outlined above, the present invention provides novel methods for 
diagnosis of disorders associated wVn angiogenesis (sometimes referred to herein as angiogenesis 
disorders or AD), as well as methods for screening for compositions which modulate angiogenesis. 
By "disorder associated with angiogenesis' or "disease associated with angiogenesis" herein Is meant 
5 a disease state which is marked by either an excess or a deficit of vessel development. Angiogenesis 
disorders include, but are not limited to, cancer and proliferative diabetic retinopathy. Also provided 
are method for treating AD. 

In one aspect, the expression levels of genes are determined in different patient samples for which 
diagnosis information is desired, to provide expression profiles. An expression profile of a particular 

1 0 sample is essentially a "fingerprint" of the state of the sample; while two states may have any 

particular gene similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is unique to the state of the cell. That Is, normal tissue 
may be distinguished from AD tissue. By comparing expression profiles of tissue In known different 
angiogenesis states. Information regarding which genes are Important (Including both up- and down- 

15 regulation of genes) in each of these states Is obtained. The identification of sequences that are 

differentially expressed in angiogenic versus non-angiogenic tissue allows the use of this information 
in a number of ways. For example, the evaluation of a particular treatment regime may be evaluated: 
does a chemotherapeutic drug act to down-regulate angiogenesis and thus tumor growth or 
recurrence in a particular patient. Similarly, diagnosis may be done or confirmed by comparing patient 

2 0 samples with the known expression profiles. Furthermore, these gene expression profiles (or 

individual genes) allow screening of drug candidates with an eye to mimicking or altering a particular 
expression profile; for example, screening can be done for drugs that suppress the angiogenic 
expression profile. This may be done by making biochips comprising sets of the Important 
angiogenesis genes, which can then be used in these screens. These methods can also be done on 
25 the protein basis; that is, protein expression levels of the angiogenic proteins can be evaluated for 
diagnostic purposes or to screen candidate agents. In addition, the angiogenic nucleic acid 
sequences can be administered for gene therapy purposes, including the administration of antisense 
nucleic acids, or the angiogenic proteins (including antibodies and other modulators thereof) 
administered as therapeutic drugs. 

3 0 Thus the present invention provides nucleic acid and protein sequences that are differentially 

expressed in angiogenesis, herein termed "angiogenesis sequences". As outlined below, 

angiogenesis sequences include those that are up-regulated (i.e. expressed at a higher level) In 
disorders associated with angiogenesis, as well as those that are down-regulated (i.e. expressed at a 
lower level). In a preferred embodiment, the angiogenesis sequences are from humans; however, as 
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will be appreciated by those in the art, angiogenesis sequences from other organisms may be useful 
in animal models of disease and drug evaluation; thus, other angiogenesis sequences are provided, 
from vertebrates, including mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), 
primates, farm animals (including sheep, goats, pigs, cows, horses, etc). Angiogenesis sequences 
5 from other organisms may be obtained using the techniques outlined below. 

Angiogenesis sequences can include both nucleic acid and amino acid sequences. In a preferred 
embodiment, the angiogenesis sequences are recombinant nucleic acids. By the term "recombinant 
nucleic acid" herein is meant nucleic acid, originally formed in vitro, in general, by the manipulation of 
nucleic acid by polymerases and endonucleases, In a form not normally found in nature. Thus an 

1 0 isolated nucleic acid, in a linear form, or an expression vector formed in vitro by ligating DNA 

molecules that are not normally joined, are both considered recombinant for the purposes of this 
invention. It is understood that once a recombinant nucleic acid is made and reintroduced into a host 
cell or organism, it will replicate non-recombinantly, i.e. using the in vivo cellular machinery of the host 
cell rather than in vitro manipulations; however, such nucleic acids, once produced recombinantly, 

15 although subsequently replicated non-recombinantly, are still considered recombinant for the purposes 
of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant techniques, i.e. through the 
expression of a recombinant nucleic acid as depicted above. A recombinant protein is distinguished 
from naturally occurring protein by at least one or more characteristics. For example, the protein may 
2 0 be isolated or purified away from some or all of the proteins and compounds with which it is normally 
associated in its wild type host, and thus may be substantially pure. For example, an isolated protein 
is unaccompanied by at least some of the material with which it is normally associated in its natural 
state, preferably constituting at least about 0.5%, more preferably at least about 5% by weight of the 
total protein in a given sample. A substantially pure protein comprises at least about 75% by weight of 

2 5 the total protein, with at least about 80% being preferred, and at least about 90% being particularly 

preferred. The definition includes the production of an angiogenesis protein from one organism in a 
different organism or host cell. Alternatively, the protein may be made at a significantly higher 
concentration than is normally seen, through the use of an inducible promoter or high expression 
promoter, such that the protein is made at increased concentration levels. Altematively, the protein 

3 0 may be in a form not normally found in nature, as in the addition of an epitope tag or amino acid 

substitutions, insertions and deletions, as discussed below. 

In a preferred embodiment, the angiogenesis sequences are nucleic acids. As will be appreciated by 
those in the art and is more fully outlined below, angiogenesis sequences are useful in a variety of 
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applications, including diagnostic applications, which will detect naturally occurring nucleic acids, as 
well as screening applications; for example, biochips comprising nucleic acid probes to the 
angiogenesis sequences can be generated. In the broadest sense, then, by "nucleic acid" or 
"oligonucleotide" or grammatical equivalents herein means at least two nucleotides covalently linked 
5 together. A nucleic acid of the present invention will generally contain phosphodiester bonds, although 
in some cases, as outlined below, nucleic acid analogs are included that may have alternate 
backbones, comprising, for example, phosphoramidate (Beaucage et al., Tetrahedron 49(10):1925 

(1993) and references therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. 
Biochem. 81:579 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai etal, Chem. Lett. 

10 805 (1984), Letsinger etal., J. Am. Chem. Soc. 110:4470 (1988); and Pauwels etal., Chemica Scripta 
26:141 91986)), phosphorothioate (Mag et al.. Nucleic Acids Res. 19:1437 (1991); and U.S. Patent 
No. 5,644,048), phosphorodithioate (Briu etal., J. Am. Chem. Soc. 111:2321 (1989), O- 
methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical 
Approach, Oxford University Press), and peptide nucleic acid backbones and linkages (see Egholm, J. 

15 Am. Chem. Soc. 114:1895 (1992); Meier etal.. Chem. Int. Ed. Engl. 31:1008 (1992); Nielsen, Nature, 
365:566 (1993); Carlsson et al., Nature 380:207 (1996), all of which are Incorporated by reference). 
Other analog nucleic acids include those with positive backbones (Denpcy et al., Proc. Natl. Acad. Sci. 
USA 92:6097 (1995): non-ionic backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 
5,216,141 and 4,469,863; Kiedrowshi etal., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger 

2 0 etal., J. Am. Chem. Soc. 110:4470 (1988); Letsinger et al.. Nucleoside & Nucleotide 13:1597 (1994); 

Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense Research", 
Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal Chem. Lett. 4:395 

(1994) : Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 37:743 (1996)) and non- 
rlbose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and 

2 5 Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense Research", 

Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or more carbocyclic sugars are also 
included within one definition of nucleic acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp169- 
176). Several nucleic acid analogs are described in Rawls, C & E News June 2, 1997 page 35. All of 
these references are hereby expressly incorporated by reference. These modifications of the ribose- 

3 0 phosphate backbone may be done for a variety of reasons, for example to increase the stability and 

half-life of such molecules in physiological environments or as probes on a biochip. 

As will be appreciated by those in the art, all of these nucleic acid analogs may find use in the present 
invention. In addition, mixtures of naturally occun-ing nucleic acids and analogs can be made; 
alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic 
3 5 acids and analogs may be made. 
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Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic acid analogs. 
These backbones are substantially non-ionic under neutral conditions, in contrast to the highly 
charged phosphodiester backbone of naturally occurring nucleic acids. This results in two 
advantages. First, the PNA backbone exhibits improved hybridization kinetics. PNAs have larger 
5 changes in the melting temperature (Tm) for mismatched versus perfectly matched basepairs. DNA 
and RNA typically exhibit a 2-4*C drop in Tm for an internal mismatch. With the non-ionic PNA 
backbone, the drop is closer to 7-9°C. Similarly, due to their non-ionic nature, hybridization of the 
bases attached to these backbones is relatively insensitive to salt concentration. In addition, PNAs 
are not degraded by cellular enzymes, and thus can be more stable. 

1 0 The nucleic acids may be single stranded or double stranded, as specified, or contain portions of both 
double stranded or single stranded sequence. As will be appreciated by those in the art, the depiction 
of a single strand ("Watson") also defines the sequence of the other strand ("Crick"); thus the 
sequences described herein also includes the complement of the sequence. The nucleic acid may be 
DNA, both genomic and cDNA, RNA or a hybrid, where the nucleic acid contains any combination of 

15 deoxyribo- and ribo-nucleotides, and any combination of bases, including uracil, adenine, thymine, 
cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the 
term "nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non-naturally 
occurring analog structures. Thus for example the individual units of a peptide nucleic acid, each 

2 0 containing a base, are referred to herein as a nucleoside. 

An angiogenesis sequence can be initially identified by substantial nucleic acid and/or amino acid 
sequence homology to the angiogenesis sequences outlined herein. Such homology can be based 
upon the overall nucleic acid or amino acid sequence, and is generally determined as outlined below, 
using either homology programs or hybridization conditions. 

25 The angiogenesis screen included comparing genes identified in an in vitro model of angiogenesis as 
described in Hiraoka, Cell 95:365 (1998), which is expressly incorporated by reference, with genes 
identified in controls. Samples of normal tissue and tissue undergoing angiogenesis are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are commercially 

3 0 available, for example from Affymetrix. Gene expression profiles as described herein are generated 

and the data analyzed. 
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In a preferred embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, including, but not limited to 
lung, heart, brain, liver, breast, kidney, muscle, prostate, small intestine, large intestine, spleen, bone 
and placenta. In a prefen-ed embodiment, those genes identified during the angiogenesis screen that 
5 are expressed in any significant amount in other tissues are removed from the profile, although in 

some embodiments, this is not necessary. That is, when screening for drugs, it is preferable that the 
target be disease specific, to minimize possible side effects. 

In a preferred embodiment, angiogenesis sequences are those that are up-regulated in angiogenesis 
disorders; that is, the expression of these genes is higher in the disease tissue as compared to normal 

10 tissue. "Up-regulation" as used herein means at least about a two-fold change, preferably at least 
about a three fold change, with at least about five-fold or higher being preferred. All accession 
numbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank Is known In the art, see, e.g., 
Benson, DA, et al., Nucleic Acids Research 26:1-7 (1998) and http://www.ncbi.nlm.nih.gov/. In 

15 addition, these genes were found to be expressed in a limited amount or not at all in heart, brain, lung, 
liver, breast, kidney, prostate, small intestine and spleen. 

In a preferred embodiment, angiogenesis sequences are those that are down-regulated in the 
angiogenesis disorder; that is, the expression of these genes is lower in angiogenic tissue as 
compared to normal tissue. "Down-regulation" as used herein means at least about a two-fold change, 
2 0 preferably at least about a three fold change, with at least about five-fold or higher being preferred. 

Angiogenesis sequences according to the invention may be classified into discrete clusters of 
sequences based on common expression profiles of the sequences. Expression levels of 
angiogenesis sequences may increase or decrease as a function of time In a manner that correlates 
with the induction of angiogenesis. Alternatively, expression levels of angiogenesis sequences may 

2 5 both increase and decrease as a function of time. For example, expression levels of some 

angiogenesis sequences are temporarily induced or diminished during the switch to the angiogenesis 
phenotype, followed by a return to baseline expression levels. Table 1 depicts 1774 genes, the 
expression of which varies as a function of time in angiogenesis tissue when compared to normal 
tissue. Figure 1 depicts 4 discrete expression profiles of angiogenesis genes identified in Table 1 . 

3 0 A particularly preferred embodiment includes the sequences as described in Table 2 which depicts a 

prefen-ed subset of 559 angiogenesis sequences, the expression of which is altered in angiogenesis 
when compared to normal tissue. 
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An additional embodiment includes the sequences as described in Table 3, which depicts 1916 genes 
including expression sequence tags (incorporated in their entirety here and throughout the application 
where Accession numbers are provided), whose expression levels change as a function of time in 
tissue undergoing angiogenesis compared to tissue that is not. 

5 A preferred embodiment includes the sequences as described in Table 4 which depicts a preferred 
subset of 558 genes identified in Table 3 whose expression levels change as a function of time in 
tissue undergoing angiogenesis compared to tissue that is not. 

A particularly preferred embodiment includes the sequences as deschbed in Table 5 which provides a 
preferred subset of 20 Accession numbers identified in Table 3 whose expression levels change as a 
1 0 function of time in tissue undergoing angiogenesis compared to tissue that is not. 

In a particularly preferred embodiment, angiogenesis sequences are those that are induced for a 
period of time followed by a retum to the baseline levels. Sequences that are temporarily induced 
provide a means to target angiogenesis tissue, for example neovascularized tumors, while avoiding 
rapidly growing tissue that require perpetual vascularization. Such positive angiogenic factors Include 

15 aFGF, bFGF, VEGF, angiogenin and the like. 

Induced angiogenesis sequences also are further categorized with respect to the timing of induction. 
For example, some angiogenesis genes may be induced at an early time period, such as with 10 
minutes of the induction of angiogenesis. Others may be induced later, such as between 5 and 60 
minutes, while yet others may be induced for a time period of about two hours or more followed by a 
2 0 return to baseline expression levels. 

In another preferred embodiment are angiogenesis sequences that are inhibited or reduced as a 
function of time followed by a return to "normal" expression levels. Inhibitors of angiogenesis are 
examples of molecules that have this expression profile. These sequences also can be further divided 
into groups depending on the timing of diminished expression. For example, some molecules may 
25 display reduced expression with 10 minutes of the induction of angiogenesis. Others may be 

diminished later, such as between 5 and 60 minutes, while others may be diminished for a time period 
of about two hours or more followed by a return to baseline. Examples of such negative angiogenic 
factors include thrombospondin and endostatin to name a few. 
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In yet another preferred embodiment are angiogenesis sequences that are induced for prolonged 
periods. These sequences are typically associated with induction of angiogenesis and may participate 
in induction and/or maintenance of the angiogenesis phenotype. 

In another preferred embodiment are angiogenesis sequences, the expression of which Is reduced or 
5 diminished for prolonged periods In angiogenic tissue. These sequences are typically angiogenesis 
inhibitors and their diminution Is correlated with an increase in angiogenesis. 

Angiogenesis proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins or intracellular proteins. In a preferred embodiment the angiogenesis protein 
is an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the nucleos. 

1 0 Intracellular proteins are involved in all aspects of cellular function and replication (Including, for 
example, signaling pathways); aberrant expression of such proteins results in unregulated or 
disregulated cellular processes. For example, many Intracellular proteins have enzymatic activity such 
as protein kinase activity, protein phosphatase activity, protease activity, nucleotide cyclase activity, 
polymerase activity and the like. Intracellular proteins also serve as docking proteins that are involved 

15 in organizing complexes of proteins, or targeting proteins to various subcellular localizations, and are 
involved In maintaining the structural integrity of organelles. 

An increasingly appreciated concept in characterizing intracellular proteins is the presence in the 
proteins of one or more motifs for which defined functions have been attributed. In addition to the 
highly conserved sequences found In the enzymatic domain of proteins, highly conserved sequences 

2 0 have been Identified in proteins that are involved in protein-protein Interaction. For example, Src- 

homology-2 (SH2) domains bind tyroslne-phosphorylated targets In a sequence dependent manner. 
PTB domains, which are distinct from SH2 domains, also bind tyrosine phosphorylated targets. SH3 
domains bind to prollne-rich targets. In addition, PH domains, tetratrlcopeptide repeats and WD 
domains to name only a few, have been shown to mediate protein-protein interactions. Some of these 
25 may also be involved in binding to phospholipids or other second messengers. As will be appreciated 
by one of ordinary skill in the art, these motifs can be Identified on the basis of primary sequence; 
thus, an analysis of the sequence of proteins may provide Insight Into both the enzymatic potential of 
the molecule and/or molecules with which the protein may associate. 

In a preferred embodiment, the angiogenesis sequences are transmembrane proteins. 

3 0 Transmembrane proteins are molecules that span the phospholipid bilayer of a cell. They may have 

an Intracellular domain, an extracellular domain, or both. The Intracellular domains of such proteins 
may have a number of functions Including those already described for Intracellular proteins. For 
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example, the intracellular domain may have enzymatic activity and/or may serve as a binding site for 
additional proteins. Frequently the intracellular domain of transmembrane proteins serves both roles. 
For example certain receptor tyrosine kinases have both protein kinase activity and SH2 domains. In 
addition, autophosphorylation of tyrosines on the receptor molecule itself, creates binding sites for 
5 additional SH2 domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. For example, 
receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases and receptor 
serine/threonine protein kinases contain a single transmembrane domain. However, various other 
proteins including channels and adenylyl cyclases contain numerous transmembrane domains. Many 

1 0 important cell surface receptors are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Important transmembrane protein receptors include, but are 
not limited to insulin receptor, insulin-like grow^th factor receptor, human growth honnone receptor, 
glucose transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 receptor, 

15 I L-2 receptor, etc. 

Characteristics of transmembrane domains include approximately 20 consecutive hydrophobic amino 
acids that may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains within the 
protein may be predicted. 

2 0 The extracellular domains of transmembrane proteins are diverse; however, conserved motifs are 
found repeatedly among various extracellular domains. Conserved structure and/or functions have 
been ascribed to different extracellular motifs. For example, cytokine receptors are characterized by a 
cluster of cysteines and a WSXWS (W= tryptophan, S= serine, X=any amino acid) motif. 
Immunoglobulin-like domains are highly conserved. Mucin-like domains may be involved in cell 

2 5 adhesion and leucine-rich repeats participate in protein-protein interactions. 

Many extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are receptors. Factors that bind the receptor domain include circulating ligands, which may 
be peptides, proteins, or small molecules such as adenosine and the like. For example, growth 
factors such as EGF, FGF and PDGF are circulating growth factors that bind to their cognate 

3 0 receptors to initiate a variety of cellular responses. Other factors include cytokines, mitogenic factors, 

neurotrophic factors and the like. Extracellular domains also bind to cell-associated molecules. In this 
respect, they mediate cell-cell interactions. Cell-associated ligands can be tethered to the cell for 
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example via a glycosylphosphatidylinositol (GPI) anchor, or may themselves be transmembrane 
proteins. Extracellular domains also associate with the extracellular matrix and contribute to the 
maintenance of the cell structure. 

Putative transmembrane angiogenesis proteins include those encoded by the sequences labeled with 
5 "Y' in the TM column depicted in Table 2. 

Angiogenesis proteins that are transmembrane are particularly preferred in the present invention as 
they are good targets for immunotherapeutics, as are described herein. In addition, as outlined below, 
transmembrane proteins can be also useful in imaging modalities. 

It will also be appreciated by those in the art that a transmembrane protein can be made soluble by 
1 0 removing transmembrane sequences, for example through recombinant methods. Furthermore, 
transmembrane proteins that have been made soluble can be made to be secreted through 
recombinant means by adding an appropriate signal sequence. 

In a preferred embodiment, the angiogenesis proteins are secreted proteins; the secretion of which 
can be either constitutive or regulated. These proteins have a signal peptide or signal sequence that 

15 targets the molecule to the secretory pathway. Secreted proteins are involved in numerous 

physiological events; by virtue of their circulating nature, they serve to transmit signals to various other 
cell types. The secreted protein may function in an autocrine manner (acting on the cell that secreted 
the factor), a paracrine manner (acting on cells in close proximity to the cell that secreted the factor) or 
an endocrine manner (acting on cells at a distance). Thus secreted molecules find use in modulating 

20 or altering numerous aspects of physiology. Angiogenesis proteins that are secreted proteins are 

particulariy preferred in the present invention as they serve as good targets for diagnostic markers, for 
example for blood tests. 

Putative secreted angiogenesis proteins include those encoded by the sequences depicted in Table 2 
that are labeled with "Y" in the SS column, but a "N" in the TM column. 

25 An angiogenesis sequence is initially identified by substantial nucleic acid and/or amino acid sequence 
homology to the angiogenesis sequences outlined herein. Such homology can be based upon the 
overall nucleic acid or amino acid sequence, and is generally determined as outlined below, using 
either homology programs or hybridization conditions. 



17 



wo 01/11086 



PCTAJSOO/22061 



As used herein, a nucleic acid is an "angiogenesls nucleic acid" if tlie overall liomology of the nucleic 
acid sequence to one of the nucleic acids of Table 1, Table 2, Table 3, Table 4 or Table 5 is preferably 
greater than about 75%, more preferably greater than about 80%, even more preferably greater than 
about 85% and most preferably greater than 90%. In some embodiments the homology will be as 
5 high as about 93 to 95 or 98%. Homology In this context means sequence similarity or Identity, with 
identity being prefen-ed. A prefen-ed comparison for homology purposes Is to compare the sequence 
containing sequencing errors to the correct sequence. This homology will be determined using 
standard techniques known in the art, including, but not limited to, the local homology algorithm of 
Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorith of Needleman 
10 & Wunsch, J. Mol. Biool. 48:443 (1970), by the search for similarity method of Pearson & Lipman, 
PNAS USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
Science Drive, Madison, Wl), the Best Fit sequence program described by Devereux et al., Nucl. Acid 
Res. 12:387-395 (1984), preferably using the default settings, or by inspection. 

15 In a preferred embodiment, the sequences which are used to determine sequence identity or similarity 
are selected from the sequences set forth in the tables and figures, preferable those represented in 
Table 4, more preferably those represented in table 5, still more preferably those of Figures 2, 3, 7, 11 , 
12, 17, 21, 23 and fragments thereof. In one embodiment the sequences utilized herein are those set 
forth in the tables and figures. In another embodiment, the sequences are naturally occurring allelic 

2 0 variants of the sequences set forth In the tables and figures. In another embodiment, the sequences 

are sequence variants as further described herein. 

One example of a useful algorithm Is PILEUP. PILEUP creates a multiple sequence alignment from a 
group of related sequences using progressive, pairwise alignments. It can also plot a tree showing the 
clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive 
25 alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360 (1987); the method is similar to that 
described by Higgins & Sharp CABIOS 5:151-153 (1989). Useful PILEUP parameters including a 
default gap weight of 3.00, a default gap length weight of 0.10, and weighted end gaps. 

Another example of a useful algorithm is the BLAST algorithm, described in Altschul et al., J. Mol. Biol. 
215, 403-410, (1990) and Karlin et al., PNAS USA 90:5873-5787 (1993). A particularly useful BLAST 

3 0 program is the WU-BLAST-2 program which was obtained from Altschul et al.. Methods in 

Enzymology, 266; 460-480 (1996); http://blast.wustn . WU-BLAST-2 uses several search parameters, 
most of which are set to the default values. The adjustable parameters are set with the following 
values: overlap span =1, overiap fraction = 0.125, word threshold (T) = 11. The HSP S and HSP S2 
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parameters are dynamic values and are established by the program itself depending upon the 
composition of the particular sequence and composition of the particular database against which the 
sequence of interest is being searched; however, the values may be adjusted to increase sensitivity. 
A % amino acid sequence identity value is determined by the number of matching identical residues 
5 divided by the total number of residues of the "longer" sequence in the aligned region. The "longer" 
sequence is the one having the most actual residues in the aligned region (gaps introduced by WU- 
Blast-2 to maximize the alignment score are ignored). 

Thus, "percent (%) nucleic acid sequence identity" is defined as the percentage of nucleotide residues 
in a candidate sequence that are identical with the nucleotide residues of the nucleic acids of the 
1 0 figures. A preferred method utilizes the BLASTN module of WU-BLAST-2 set to the default 
parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively. 

The alignment may Include the introduction of gaps in the sequences to be aligned. In addition, for 
sequences which contain either more or fewer nucleotides than those of the nucleic acids of the 
figures, it is understood that the percentage of homology will be determined based on the number of 
1 5 homologous nucleosides in relation to the total number of nucleosides. Thus, for example, homology 
of sequences shorter than those of the sequences identified herein and as discussed below, will be 
determined using the number of nucleosides in the shorter sequence. 

In one embodiment, the nucleic acid homology is determined through hybridization studies. Thus, for 
example, nucleic acids which hybridize under high stringency to the nucleic acids identified in the 
2 0 figures, or their complements, are considered an angiogenesis sequence. High stringency conditions 
are known in the art; see for example Maniatis et al.. Molecular Cloning: A Laboratory Manual, 2d 
Edition, 1989, and Short Protocols in Molecular Biology, ed. Ausubel, et al., both of which are hereby 
incorporated by reference. Stringent conditions are sequencendependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures. An 

2 5 extensive guide to the hybridization of nucleic acids is found in Tijssen, Techniques in Biochemistry 

and Molecular Biology-Hybridization with Nucleic Acid Probes, "Overview of principles of hybridization 
and the strategy of nucleic acid assays" (1993). Generally, stringent conditions are selected to be 
about 5-1 0°C lower than the thennal melting point (Tm) for the specific sequence at a defined ionic 
strength pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid 

3 0 concentration) at which 50% of the probes complementary to the target hybridize to the target 

sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other salts) at pH 
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7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g. 1 0 to 50 nucleotides) and 
at least about 60°C for long probes (e.g. greater than 50 nucleotides). Stringent conditions may also 
be achieved with the addition of destabilizing agents such as fonnamide. 

In another embodiment, less stringent hybridization conditions are used; for example, moderate or low 
5 stringency conditions may be used, as are l<nown in the art; see Maniatis and Ausubel, supra, and 
Tijssen, supra. 

In addition, the angiogenesis nucleic acid sequences of the invention are fragments of larger genes, 
i.e. they are nucleic acid segments. "Genes" in this context includes coding regions, non-coding 
regions, and mixtures of coding and non-coding regions. Accordingly, as will be appreciated by those 
10 in the art, using the sequences provided herein, additional sequences of the angiogenesis genes can 
be obtained, using techniques well known in the art for cloning either longer sequences or the full 
length sequences; see Maniatis et al., and Ausubel, et al., supra, hereby expressly incorporated by 
reference. 

Once the angiogenesis nucleic acid is identified, it can be cloned and, if necessary, its constituent 
15 parts recombined to form the entire angiogenesis nucleic acid. Once isolated from its natural source, 
e.g., contained within a plasmid or other vector or excised therefrom as a linear nucleic acid segment, 
the recombinant angiogenesis nucleic acid can be further-used as a probe to identify and Isolate other 
angiogenesis nucleic acids, for example additional coding regions. It can also be used as a 
"precursor' nucleic acid to make modified or variant angiogenesis nucleic acids and proteins. 

2 0 The angiogenesis nucleic acids of the present invention are used in several ways. In a first 

embodiment, nucleic acid probes to the angiogenesis nucleic acids are made and attached to biochips 
to be used in screening and diagnostic methods, as outlined below, or for administration, for example 
for gene therapy and/or antisense applications. Alternatively, the angiogenesis nucleic acids that 
include coding regions of angiogenesis proteins can be put into expression vectors for the expression 
25 of angiogenesis proteins, again either for screening purposes or for administration to a patient. 

In a prefen-ed embodiment, nucleic acid probes to angiogenesis nucleic acids (both the nucleic acid 
sequences outlined in the figures and/or the complements thereof) are made. The nucleic acid 
probes attached to the biochip are designed to be substantially complementary to the angiogenesis 
nucleic acids, i.e. the target sequence (either the target sequence of the sample or to other probe 

3 0 sequences, for example in sandwich assays), such that hybridization of the target sequence and the 

probes of the present invention occurs. As outlined below, this complementarity need not be perfect; 
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there may be any number of base pair mismatches which will interfere with hybridization between the 
target sequence and the single stranded nucleic acids of the present invention. However, if the 
number of mutations is so great that no hybridization can occur under even the least stringent of 
hybridization conditions, the sequence is not a complementary target sequence. Thus, by 
5 "substantially complementary" herein is meant that the probes are sufficiently complementary to the 

target sequences to hybridize under normal reaction conditions, particularly high stringency conditions, 
as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and partially double 
stranded. The strandedness of the probe is dictated by the structure, composition, and properties of 
10 the target sequence. In general, the nucleic acid probes range from about 8 to about 100 bases long, 
with from about 10 to about 80 bases being preferred, and from about 30 to about 50 bases being 
particularly preferred. That is, generally whole genes are not used. In some embodiments, much 
longer nucleic acids can be used, up to hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either overlapping 
15 probes or probes to different sections of the target being used. That is, two, three, four or more 
probes, with three being preferred, are used to build in a redundancy for a particular target. The 
probes can be overlapping (i.e. have some sequence in common), or separate. 

As will be appreciated by those in the art, nucleic acids can be attached or immobilized to a solid 
support in a wide variety of ways. By "immobilized" and grammatical equivalents herein is meant the 

2 0 association or binding between the nucleic acid probe and the solid support is sufficient to be stable 

under the conditions of binding, washing, analysis, and removal as outlined below. The binding can be 
covalent or non-covalent. By "non-covalent binding' and grammatical equivalents herein is meant one 
or more of either electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent 
binding is the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
25 covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are attached by at 
least one bond, including sigma bonds, pi bonds and coordination bonds. Covalent bonds can be 
formed directly between the probe and the solid support or can be formed by a cross linker or by 
inclusion of a specific reactive group on either the solid support or the probe or both molecules. 

3 0 Immobilization may also involve a combination of covalent and non-covalent interactions. 
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In general, the probes are attached to the biochip in a wide variety of ways, as will be appreciated by 
those in the art. As described herein, the nucleic acids can either be synthesized first, with 
subsequent attachment to the biochip, or can be directly synthesized on the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or other 
5 grammatical equivalents herein is meant any material that can be modified to contain discrete 
individual sites appropriate for the attachment or association of the nucleic acid probes and is 
amenable to at least one detection method. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
functional ized glass, plastics (including acrylics, polystyrene and copolymers of styrene and other 

10 materials, polypropylene, polyethylene, polybutylene, polyurethanes, TeflonJ, etc.), polysaccharides, 
nylon or nitrocellulose, resins, silica or silica-based materials including silicon and modified silicon, 
carbon, metals, inorganic glasses, plastics, etc. In general, the substrates allow optical detection and 
do not appreciably fluorescese. A prefen-ed substrate is described in copending application entitled 
Reusable Low Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 

15 1999, herein incorporated by reference in its entirely. 

Generally the substrate is planar, although as will be appreciated by those in the art, other 
configurations of substrates may be used as well. For example, the probes may be placed on the 
inside surface of a tube, for flow-through sample analysis to minimize sample volume. Similarly, the 
substrate may be flexible, such as a flexible foam, including closed cell foams made of particular 
2 0 plastics. 

In a prefened embodiment, the surface of the biochip and the probe may be derivatized with chemical 
functional groups for subsequent attachment of the two. Thus, for example, the biochip is derivatized 
with a chemical functional group including, but not limited to, amino groups, carboxy groups, oxo 
groups and thiol groups, with amino groups being particularly preferred. Using these functional 

2 5 groups, the probes can be attached using functional groups on the probes. For example, nucleic 

acids containing amino groups can be attached to surfaces comprising amino groups, for example 
using linkers as are known in the art; for example, homo-or hetero-bifunctional linkers as are well 
known (see 1994 Pierce Chemical Company catalog, technical section on cross-linkers, pages 
155-200, incorporated herein by reference). In addition, in some cases, additional linkers, such as 

3 0 alkyi groups (including substituted and heteroalkyi groups) may be used. 
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In this embodiment, tlie oligonucleotides are synthesized as is known in the art, and then attached to 
the surface of the solid support. As will be appreciated by those skilled in the art, either the 5' or 3' 
terminus may be attached to the solid support, or attachment may be via an internal nucleoside. 

In an additional embodiment, the immobilization to the solid support may be very strong, yet non- 
5 covalent. For example, biotinylated oligonucleotides can be made, which bind to surfaces covalently 
coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is known in the art. For 
example, photoactivation techniques utilizing photopolymerization compounds and techniques are 
used. In a preferred embodiment, the nucleic acids can be synthesized in situ, using well known 
10 photolithographic techniques, such as those described in WO 95/25116; WO 95/35505; U.S. Patent 
Nos. 5,700,637 and 5,445,934; and references cited within, all of which are expressly incorporated by 
reference; these methods of attachment form the basis of the Affimetrix GeneChip™ technology. 

In a preferred embodiment, angiogenesis nucleic acids encoding angiogenesis proteins are used to 
make a variety of expression vectors to express angiogenesis proteins which can then be used in 

15 screening assays, as described below. The expression vectors may be either self-replicating 
extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translational regulatory nucleic acid operably linked to 
the nucleic acid encoding the angiogenesis protein. The term "control sequences" refers to DNA 
sequences necessary for the expression of an operably linked coding sequence in a particular host 

2 0 organism. The control sequences that are suitable for prokaryotes, for example, include a promoter, 
optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 

2 5 acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA 

for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; 
a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the 
sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are 

3 0 contiguous, and, in the case of a secretory leader, contiguous and in reading phase. However, 

enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction 
sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in 
accordance with conventional practice. The transcriptional and translational regulatory nucleic acid 
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will generally be appropriate to the host cell used to express the angiogenesis protein; for example, 
transcriptional and translational regulatory nucleic acid sequences from Bacillus are preferably used to 
express the angiogenesis protein in Bacillus. Numerous types of appropriate expression vectors, and 
suitable regulatory sequences are known in the art for a variety of host cells. 

5 In general, the transcriptional and translational regulatory sequences may include, but are not limited 
to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, 
translational start and stop sequences, and enhancer or activator sequences. In a preferred 
embodiment, the regulatory sequences include a promoter and transcriptional start and stop 
sequences. 

1 0 Promoter sequences encode either constitutive or inducible promoters. The promoters may be either 
naturally occurring promoters or hybrid promoters. Hybrid promoters, which combine elements of 
more than one promoter, are also known in the art, and are useful in the present invention. 

In addition, the expression vector may comprise additional elements. For example, the expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, for 

15 example in mammalian or insect cells for expression and in a procaryotic host for cloning and 

amplification. Furthermore, for integrating expression vectors, the expression vector contains at least 
one sequence homologous to the host cell genome, and preferably two homologous sequences which 
flank the expression construct. The integrating vector may be directed to a specific locus in the host 
cell by selecting the appropriate homologous sequence for inclusion in the vector. Constructs for 

2 0 integrating vectors are well known in the art. 

In addition, in a preferred embodiment, the expression vector contains a selectable marker gene to 
allow the selection of transformed host cells. Selection genes are well known in the art and will vary 
with the host cell used. 

The angiogenesis proteins of the present invention are produced by culturing a host cell transformed 

2 5 with an expression vector containing nucleic acid encoding an angiogenesis protein, under the 

appropriate conditions to induce or cause expression of the angiogenesis protein. The conditions 
appropriate for angiogenesis protein expression will vary with the choice of the expression vector and 
the host cell, and will be easily ascertained by one skilled in the art through routine experimentation. 
For example, the use of constitutive promoters in the expression vector will require optimizing the 

3 0 growth and proliferation of the host cell, while the use of an inducible promoter requires the 

appropriate growth conditions for induction. In addition, in some embodiments, the timing of the 

24 
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harvest is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and animal cells, 
including mammalian cells. Of particular interest are Drosophila melangaster cells, Saccharomyces 
5 cerevisiae and other yeasts, E. coli. Bacillus subtilis, Sf9 cells, C1 29 cells, 293 cells, Neurospora, 
BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells),THP1 cells (a 
macrophage cell line) and human cells and lines. 

In a preferred embodiment, the angiogenesis proteins are expressed in mammalian cells. Mammalian 
expression systems are also l<nov\'n in the art, and include retroviral systems. A preferred expression 

1 0 vector system is a retroviral vector system such as is generally described in PCT/US97/01 01 9 and 

PCT/US97/01048, both of which are hereby expressly incorporated by reference. Of particular use as 
mammalian promoters are the promoters from mammalian viral genes, since the viral genes are often 
highly expressed and have a broad host range. Examples include the SV40 early promoter, mouse 
mammary tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 

15 and the CMV promoter. Typically, transcription termination and polyadenylation sequences 

recognized by mammalian cells are regulatory regions located 3' to the translation stop codon and 
thus, together with the promoter elements, flank the coding sequence. Examples of transcription 
terminator and polyadenlytion signals include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as well as other hosts, is 
2 0 well known in the art, and will vary with the host cell used. Techniques include dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, and direct 
microinjection of the DNA into nuclei. 

In a preferred embodiment, angiogenesis proteins are expressed in bacterial systems. Bacterial 
25 expression systems are well known in the art. Promoters from bacteriophage may also be used and 
are known in the art. In addition, synthetic promoters and hybrid promoters are also useful; for 
example, the tac promoter is a hybrid of the trp and lac promoter sequences. Furthermore, a bacterial 
promoter can include naturally occuning promoters of non-bacterial origin that have the ability to bind 
bacterial RNA polymerase and initiate transcription. In addition to a functioning promoter sequence, 
30 an efficient ribosome binding site is desirable. The expression vector may also include a signal 

peptide sequence that provides for secretion of the angiogenesis protein in bacteria. The protein is 
either secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
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between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial expression 
vector may also include a selectable marker gene to aWow for the selection of bacterial strains that 
have been transformed. Suitable selection genes include genes which render the bacteria resistant to 
drugs such as ampicillln, chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. 
5 Selectable markers also include biosynthetic genes, such as those in the histidine, tryptophan and 
leucine biosynthetic pathways. These components are assembled Into expression vectors. 
Expression vectors for bacteria are well known in the art, and include vectors for Bacillus subtilis, E. 
coli, Streptococcus cremoris, and Streptococcus lividans, among others. The bacterial expression 
vectors are transformed into bacterial host cells using techniques well known In the art, such as 
1 0 calcium chloride treatment, electroporation, and others. 

In one embodiment, angiogenesis proteins are produced in insect cells. Expression vectors for the 
transformation of Insect cells, and in particular, baculovirus-based expression vectors, are well known 
In the art. 

In a preferred embodiment, angiogenesis protein Is produced In yeast cells. Yeast expression 
15 systems are well known in the art, and include expression vectors for Saccharomyces cerevisiae, 

Candida albicans and C. maltosa, Hansenula polymorpha, Kluyveromyces fragilis and K. lactis, Pichia 
guillerimondii and P. pastoris, Schizosaccharomyces pombe, and Yarrowia lipolyt'ca. 

The angiogenesis protein may also be made as a fusion protein, using techniques well known in the 
art. Thus, for example, for the creation of monoclonal antibodies. If the desired epitope is small, the 
2 0 angiogenesis protein may be fused to a carrier protein to form an immunogen. Alternatively, the 

angiogenesis protein may be made as a fusion protein to Increase expression, or for other reasons. 
For example, when the angiogenesis protein Is an angiogenesis peptide, the nucleic acid encoding the 
peptide may be linked to other nucleic acid for expression purposes. 

In one embodiment, the angiogenesis nucleic acids, proteins and antibodies of the invention are 

2 5 labeled. By "labeled" herein is meant that a compound has at least one element, isotope or chemical 

compound attached to enable the detection of the compound. In general, labels fall into three classes: 
a) isotopic labels, which may be radioactive or heavy isotopes; b) immune labels, which may be 
antibodies or antigens; and c) colored or fluorescent dyes. The labels may be incorporated into the 
angiogenesis nucleic acids, proteins and antibodies at any position. For example, the label should be 

3 0 capable of producing, either directly or indirectly, a detectable signal. The detectable moiety may be a 

radioisotope, such as 'H, '''C, ^^P, ^^S, or '^^1, a fluorescent or chemiluminescent compound, such as 
fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta- 
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galactosidase or horseradish peroxidase. Any method l<nown in the art for conjugating the antibody to 
the label may be employed, including those methods described by Hunter et al., Nature . 144 :945 
(1962); David et al., Biochemistry. 13:1014 (1974); Pain et al., J. Immunol. Math. . 40:219 (1981); and 
Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). 

5 Accordingly, the present invention also provides angiogenesis protein sequences. An angiogenesis 
protein of the present invention may be identified in several ways. "Protein" in this sense includes 
proteins, polypeptides, and peptides. As will be appreciated by those in the art, the nucleic acid 
sequences of the invention can be used to generate protein sequences. There are a variety of ways 
to do this, including cloning the entire gene and verifying its frame and amino acid sequence, or by 

1 0 comparing it to known sequences to search for homology to provide a frame, assuming the 

angiogenesis protein has homology to some protein in the database being used. Generally, the 
nucleic acid sequences are Input into a program that will search all three frames for homology. This is 
done in a preferred embodiment using the following NCBI Advanced BLAST parameters. The program 
is blastx or blastn. The database is nr. The input data is as "Sequence in FASTA format". The 

15 organism list is "none". The "expect" is 10; the filter is default. The "descriptions" is 500, the 

"alignments" is 500, and the "alignment view" is pairwise. The "Query Genetic Codes" is standard (1 ). 
The matrix is BLOSUM62; gap existence cost is 1 1 , per residue gap cost is 1 ; and the lambda ratio is 
.85 default. This results in the generation of a putative protein sequence. 

Also included within one embodiment of angiogenesis proteins are amino acid variants of the naturally 
2 0 occurring sequences, as determined herein. Preferably, the variants are preferably greater than about 
75% homologous to the wild-type sequence, more preferably greater than about 80%, even more 
preferably greater than about 85% and most preferably greater than 90%. In some embodiments the 

homology will be as high as about 93 to 95 or 98%. As for nucleic acids, homology in this context 
means sequence similarity or identity, with identity being preferred. This homology will be determined 

2 5 using standard techniques known in the art as are outlined above for the nucleic acid homologies. 

Angiogenesis proteins of the present invention may be shorter or longer than the wild type amino acid 
sequences. Thus, in a preferred embodiment, included within the definition of angiogenesis proteins 
are portions or fragments of the wild type sequences, herein. In addition, as outlined above, the 
angiogenesis nucleic acids of the invention may be used to obtain additional coding regions, and thus 

3 0 additional protein sequence, using techniques known in the art. 

In a preferred embodiment, the angiogenesis proteins are derivative or variant angiogenesis proteins 
as compared to the wild-type sequence. That is, as outlined more fully below, the derivative 
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angiogenesis peptide will contain at least one amino acid substitution, deletion or insertion, with amino 
acid substitutions being particularly prefen-ed. The amino acid substitution, insertion or deletion may 
occur at any residue within the angiogenesis peptide. 

Also included within one embodiment of angiogenesis proteins of the present invention are amino acid 
5 sequence variants. These variants fall into one or more of three classes: substitutional, insertional or 
deletional variants. These variants ordinarily are prepared by site specific mutagenesis of nucleotides 
in the DNA encoding the angiogenesis protein, using cassette or PGR mutagenesis or other 
techniques well known in the art, to produce DNA encoding the variant, and thereafter expressing the 
DNA in recombinant cell culture as outlined above. However, variant angiogenesis protein fragments 

10 having up to about 100-150 residues may be prepared by in vitro synthesis using established 

techniques. Amino acid sequence variants are characterized by the predetermined nature of the 
variation, a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
angiogenesis protein amino acid sequence. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be selected which 

1 5 have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is predetermined, the 
mutation per se need not be predetermined. For example, in order to optimize the performance of a 
mutation at a given site, random mutagenesis may be conducted at the target codon or region and the 
expressed angiogenesis variants screened for the optimai combination of desired activity. Techniques 
2 0 for making substitution mutations at predetermined sites in DNA having a known sequence are well 
known, for example, Ml 3 primer mutagenesis and PGR mutagenesis. Screening of the mutants is 
done using assays of angiogenesis protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on the order of from 
about 1 to 20 amino acids, although considerably larger insertions may be tolerated. Deletions range 

2 5 from about 1 to about 20 residues, although in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive at a final 
derivative. Generally these changes are done on a few amino acids to minimize the alteration of the 
molecule. However, larger changes may be tolerated in certain circumstances. When small 
alterations in the characteristics of the angiogenesis protein are desired, substitutions are generally 

3 0 made in accordance with the following chart: 
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Chart I 



Original Residue 



Exemplary Substitutions 



Ala 

Arg 

Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 

lie 

Leu 

Lys 

Met 

Phe 

Ser 

Thr 



Ser 
Lys 



5 



Gin, His 



Glu 
Ser 
Asn 



Asp 



10 



Pro 



15 



Asn, Gin 
Leu, Vai 
lie, Val 
Arg, Gin, 
Leu, lie 
Met, Leu, 
Thr 



Glu 



Tyr 



Trp 
Tyr 
Val 



Ser 
Tyr 



20 



Trp, Phe 
lie, Leu 



Substantial changes in function or immunologicai identity are made by selecting substitutions that are 
less conservative than those shown in Chart I. For example, substitutions may be made which more 
significantly affect: the structure of the polypeptide backbone in the area of the alteration, for example 

2 5 the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the molecule at the target 

site; or the bull< of the side chain. The substitutions which in general are expected to produce the 
greatest changes in the polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl 
orthreonyl, is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylaianyl, vaiyi or 
alanyl; (b) a cysteine or praline is substituted for (or by) any other residue: (c) a residue having an 

3 0 electropositive side chain, e.g. iysyl, arginyl, or histidyl, is substituted for (or by) an electronegative 

residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will elicit the same immune 
response as the naturally-occurring analogue, although variants also are selected to modify the 
3 5 characteristics of the angiogenesis proteins as needed. Alternatively, the variant may be designed 

such that the biological activity of the angiogenesis protein is altered. For example, glycosylation sites 
may be altered or removed. 

Covalent modifications of angiogenesis polypeptides are included within the scope of this invention. 
One type of covalent modification includes reacting targeted amino acid residues of an angiogenesis 
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polypeptide with an organic derivatizing agent that is capable of reacting with selected side chains or 
the N-or C-terminal residues of an angiogenesis polypeptide. Derivatization with bifunctional agents is 
useful, for instance, for crosslinking angiogenesis polypeptides to a water-insoluble support matrix or 
surface for use in the method for purifying anti-angiogenesis polypeptide antibodies or screening 
5 assays, as is more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters 
with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithlobis(succinimidylpropionate), bifunctional malelmides such as bis-N-malelmido-1,8-octane and 
agents such as methyl-3-[(p-azidophenyl)dithia]propioimidate. 

10 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and asparty! residues, respectively, hydroxylation of proline and lysine, phosphorylation of 
hydroxyl groups of seryl, threonyl or tyrosyl residues, methylation of the a-amino groups of lysine, 
arginine, and histidine side chains U.E. Creighton, Proteins: Structure and Molecular Properties, W.H. 
15 Freeman & Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation 
of any C-terminal carboxyl group. 

Another type of covalent modification of the angiogenesis polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
2 0 glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate 

moieties found In native sequence angiogenesis polypeptide, and/or adding one or more glycosylation 
sites that are not present in the native sequence angiogenesis polypeptide. 

Addition of glycosylation sites to angiogenesis polypeptides may be accomplished by altering the 

2 5 amino acid sequence thereof. The alteration may be made, for example, by the addition of, or 

substitution by, one or more serine or threonine residues to the native sequence angiogenesis 
polypeptide (for O-linked glycosylation sites). The angiogenesis amino acid sequence may optionally 
be altered through changes at the DNA level, particularly by mutating the DNA encoding the 
angiogenesis polypeptide at preselected bases such that codons are generated that will translate into 

3 0 the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the angiogenesis polypeptide is 
by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in 
the art, e.g., in WO 87/05330 published 1 1 September 1987, and in Aplin and Wriston, CRC Crit. Rev. 
Biochem., pp. 259-306 (1981). 
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Removal of carbohydrate moieties present on the angiogenesis polypeptide may be accomplished 
chemically or enzymatically or by mutational substitution of codons encoding for amino acid residues 
that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and 
described, for instance, by Hakimuddin, et al., Arch. Biochem. Biophys., 259:52 (1987) and by Edge at 
5 al., Anal. Biochem., 118:131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides 
can be achieved by the use of a variety of endo-and exo-glycosidases as described by Thotakura et 
al., Meth. Enzymol., 138:350 (1987). 

Another type of covalent modification of angiogenesis comprises linking the angiogenesis polypeptide 
to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, polypropylene glycol, or 
10 polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 
4,670,417; 4,791,192 or4,179,337. 

Angiogenesis polypeptides of the present invention may also be modified in a way to form chimeric 
molecules comprising an angiogenesis polypeptide fused to another, heterologous polypeptide or 
amino acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of an 

15 angiogenesis polypeptide with a tag polypeptide which provides an epitope to which an anti-tag 

antibody can selectively bind. The epitope tag is generally placed at the amino-or carboxyl-terminus of 
the angiogenesis polypeptide. The presence of such epitope-tagged forms of an angiogenesis 
polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of the 
epitope tag enables the angiogenesis polypeptide to be readily purified by affinity purification using an 

2 0 anti-tag antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of an angiogenesis polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric 
molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. Examples include 
25 poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its 
antibody 12CA5 [Field etal., Mol. Cell. Biol., 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 
6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology, 5:3610-3616 
(1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al., 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et 
30 al., BioTechnology. 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al.. Science, 255:192-194 
(1992)]; tubulin epitope peptide [Skinner et al., J. Biol. Chem., 266:15163-15166 (1991)]; and the T7 
gene 10 protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA, 87:6393-6**397 
(1990)]. 
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Also included with an embodiment of angiogenesis protein are other angiogenesis proteins of the 
angiogenesis family, and angiogenesis proteins from other organisms, which are cloned and 
expressed as outlined below. Thus, probe or degenerate polymerase chain reaction (PGR) primer 
sequences may be used to find other related angiogenesis proteins from humans or other organisms. 
5 As will be appreciated by those in the art, particularly useful probe and/or PGR primer sequences 

include the unique areas of the angiogenesis nucleic acid sequence. As is generally l<nown in the art, 
preferred PGR primers are from about 15 to about 35 nucleotides in length, with from about 20 to 
about 30 being preferred, and may contain inosine as needed. The conditions for the PGR reaction 
are well known in the art. 

10 In addition, as is outlined herein, angiogenesis proteins can be made that are longer than those 

encoded by the nucleic acids of the figures, for example, by the elucidation of additional sequences, 
the addition of epitope or purification tags, the addition of other fusion sequences, etc. 

Angiogenesis proteins may also be identified as being encoded by angiogenesis nucleic acids. Thus, 
angiogenesis proteins are encoded by nucleic acids that will hybridize to the sequences of the 
1 5 sequence listings, or their complements, as outlined herein. 

In a preferred embodiment, when the angiogenesis protein is to be used to generate antibodies, for 
example for immunotherapy, the angiogenesis protein should share at least one epitope or 
determinant with the full length protein. By "epitope" or "determinant" herein is meant a portion of a 
protein which will generate and/or bind an antibody or T-cell receptor in the context of MHC. Thus, in 
2 0 most instances, antibodies made to a smaller angiogenesis protein will be able to bind to the full 

length protein. In a preferred embodiment, the epitope is unique; that is, antibodies generated to a 
unique epitope show little or no cross-reactivity. In a preferred embodiment, the epitope is selected 
from AAA4p1 and AAA4p2. In another preferred embodiment the epitope is selected from AAA1 pi 
and AAA1 p2. In another prefen-ed embodiment the epitope is selected from AAA7p1 , AAA7p2, 

2 5 AAA7p3 and AAA7p1 m. 

In one embodiment, the term "antibody" includes antibody fragments, as are l<nown in the art, 
including Fab, Fabj, single chain antibodies (Fv for example), chimeric antibodies, etc., either 
produced by the modification of whole antibodies or those synthesized de novo using recombinant 
DNA technologies. 

3 0 Methods of preparing polyclonal antibodies are l<nown to the skilled artisan. Polyclonal antibodies can 

be raised in a mammal, for example, by one or more injections of an immunizing agent and, if desired. 
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an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal by 
multiple subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It may be 
useful to conjugate the Immunizing agent to a protein known to be immunogenic in the mammal being 
5 Immunized. Examples of such immunogenic proteins include but are not limited to keyhole limpet 
hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of 
adjuvants which may be employed Include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The Immunization protocol may be 
selected by one skilled in the art without undue experimentation. 

10 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be prepared 
using hybridoma methods, such as those described by Kohler and Milstein, Nature . 256:495 (1975). 
In a hybridoma method, a mouse, hamster, or other appropriate host animal. Is typically Immunized 
with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the Immunizing agent. Alternatively, the lymphocytes may be Immunized in 

1 5 vitro. The immunizing agent will typically include a polypeptide encoded by a nucleic acid of Table 1 , 
Table 2, Table 3, Table 4 or Table 5 or fragment thereof or a fusion protein thereof. Generally, either 
peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or 
lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to fomn 

20 a hybridoma cell [Coding, Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 
59-103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The 
hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more 
substances that inhibit the growth or survival of the unfused, Immortalized cells. For example, If the 

2 5 parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), 

the culture medium for the hybridomas typically will Include hypoxanthine, aminopterin, and thymidine 
("HAT medium"), which substances prevent the growth of HGPRT-deflclent cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are monoclonal, 
preferably human or humanized, antibodies that have binding specificities for at least two different 

3 0 antigens. In the present case, one of the binding specificities Is for a protein encoded by a nucleic 

acid of figure 1 or 3-6 or a fragment thereof, the other one Is for any other antigen, and preferably for a 
cell-surface protein or receptor or receptor subunit, preferably one that is tumor specific. 
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In a preferred embodiment, the antibodies to angiogenesis protein are capable of reducing or 
eliminating the biological function of angiogenesis protein, as is described below. That is, the addition 
of anti-angiogenesis protein antibodies (either polyclonal or preferably monoclonal) to angiogenic 
tissue (or cells containing angiogenesis) may reduce or eliminate the angiogenesis activity. Generally, 
5 at least a 25% decrease in activity is preferred, with at least about 50% being particularly preferred 
and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the angiogenesis proteins are humanized antibodies. 
Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab", F(ab')2 or other antigen-binding 

10 subsequences of antibodies) w/hich contain minimal sequence derived from non-human 

immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in vi^hich 
residues form a complementary determining region (CDR) of the recipient are replaced by residues 
from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit having the desired 
specificity, affinity and capacity. In some instances, Fv framework residues of the human 

15 immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies may also 
comprise residues which are found neither in the recipient antibody nor in the imported CDR or 
framework sequences. In general, the humanized antibody will comprise substantially all of at least 
one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond 
to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of a 

20 human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin 
[Jones et al., Nature , 321:522-525 (1986); Riechmann et al., Nature . 332:323-329 (1988); and Presta, 
Curr. Op. Struct. Biol. . 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 

25 antibody has one or more amino acid residues introduced into it from a source which is non-human. 
These non-human amino acid residues are often referred to as import residues, which are typically 
taken from an import variable domain. Humanization can be essentially performed following the 
method of Winter and co-workers [Jones et al., Nature , 321:522-525 (1 986); Riechmann et al., Nature , 
332:323-327 (1988); Verhoeyen et al.. Science. 239:1534-1536 (1988)], by substituting rodent CDRs 

3 0 or CDR sequences for the con^sponding sequences of a human antibody. Accordingly, such 

humanized antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less 
than an intact human variable domain has been substituted by the corresponding sequence from a 
non-human species. In practice, humanized antibodies are typically human antibodies in which some 
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CDR residues and possibly some FR residues are substituted by residues from analogous sites in 
rodent antibodies. 

Human antibodies can aiso be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter, J. Mol. Biol. . 227:381 (1991 ); Marl<s et al.. J. Mol. Biol. . 
5 222:581 (1991 )]. The techniques of Coie et al. and Boerner et al. are also available for the preparation 
of human monocionai antibodies (Coie et al., Monoclonal Antibodies and Cancer Therapy . Alan R. 
Liss, p. 77 (1985) and Boerner et al., J. Immunol. . 147(1 ) :86-95 (1991)]. Similarly, human antibodies 
can be made by introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which 
the endogenous immunoglobulin genes have been partially or completely inactivated. Upon 

1 0 challenge, human antibody production is observed, which closely resembles that seen in humans in all 
respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825: 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marl<s et al.. Bio/Technology 10. 779-783 
(1992); Lonberg et al.. Nature 368 856-859 (1994); Morrison. Nature 368 . 812-13 (1994); Fishwild et 

15 al., Nature Biotechnology 14. 845-51 (1996); Neuberoer. Nature Biotechnology 14. 826 (1996); 
Lonberg and Huszar. Intern. Rev. Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of angiogenesis with an antibody raised against angiogenesis 
proteins. As used herein, immunotherapy can be passive or active. Passive immunotherapy as 
defined herein is the passive transfer of antibody to a recipient (patient). Active immunization is the 

2 0 induction of antibody and/or T-cell responses in a recipient (patient). Induction of an immune 

response is the result of providing the recipient with an antigen to which antibodies are raised. As 
appreciated by one of ordinary skill in the art, the antigen may be provided by injecting a polypeptide 
against which antibodies are desired to be raised into a recipient, or contacting the recipient with a 
nucleic acid capable of expressing the antigen and under conditions for expression of the antigen. 

25 In a preferred embodiment the angiogenesis proteins against which antibodies are raised are secreted 
proteins as described above. Without being bound by theory, antibodies used for treatment, bind and 
prevent the secreted protein from binding to its receptor, thereby inactivating the secreted 
angiogenesis protein. 

In another preferred embodiment, the angiogenesis protein to which antibodies are raised is a 

3 0 transmembrane protein. Without being bound by theory, antibodies used for treatment, bind the 

extracellular domain of the angiogenesis protein and prevent it from binding to other proteins, such as 
circulating ligands or cell-associated molecules. The antibody may cause down-regulation of the 
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transmembrane angiogenesis protein. As will be appreciated by one of ordinary skill in the art, the 
antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein binding to the 
extracellular domain of the angiogenesis protein. The antibody is also an antagonist of the 
angiogenesis protein. Further, the antibody prevents activation of the transmembrane angiogenesis 
5 protein. In one aspect, when the antibody prevents the binding of other molecules to the angiogenesis 
protein, the antibody prevents growth of the cell. The antibody also sensitizes the cell to cytotoxic 
agents, including, but not limited to TNF-a, TNF-p, IL-1, INF-y and IL-2, or ciiemotherapeutic agents 
including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, and the like. In some instances the 
antibody belongs to a sub-type that activates serum complement when complexed with the 
10 transmembrane protein thereby mediating cytotoxicity, Thus, angiogenesis is treated by administering 
to a patient antibodies directed against the transmembrane angiogenesis protein. 

In another preferred embodiment, the antibody is conjugated to a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the angiogenesis protein. In 
another aspect the therapeutic moiety modulates the activity of molecules associated with or in close 
15 proximity to the angiogenesis protein. The therapeutic moiety may inhibit enzymatic activity such as 
protease or collagenase activity associated with angiogenesis. 

In a preferred embodiment, the therapeutic moiety may also be a cytotoxic agent. In this method, 
targeting the cytotoxic agent to angiogenesis tissue or cells, results in a reduction In the number of 
afflicted cells, thereby reducing symptoms associated with angiogenesis. Cytotoxic agents are 

2 0 numerous and varied and include, but are not limited to, cytotoxic drugs or toxins or active fragments 
of such toxins. Suitable toxins and their corresponding fragments include diptheria A chain, exotoxin 
A chain, ricin A chain, abrin A chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic 
agents also include radiochemicals made by conjugating radioisotopes to antibodies raised against 
angiogenesis proteins, or binding of a radionuclide to a chelating agent that has been covalently 

2 5 attached to the antibody. Targeting the therapeutic moiety to transmembrane angiogenesis proteins 
not only serves to increase the local concentration of therapeutic moiety in the angiogenesis afflicted 
area, but also serves to reduce deleterious side effects that may be associated with the therapeutic 
moiety. 

In another preferred embodiment, the angiogenesis protein against which the antibodies are raised is 
30 an intracellular protein. In this case, the antibody may be conjugated to a protein which facilitates 

entry into the cell. In one case, the antibody enters the cell by endocytosis. In another embodiment, a 
nucleic acid encoding the antibody is administered to the individual or cell. Moreover, wherein the 
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angiogenesis protein can be targeted within a cell, i.e., the nucleus, an antibody thereto contains a 
signal for that target localization, i.e., a nuclear localization signal. 

The angiogenesis antibodies of the invention specifically bind to angiogenesis proteins. By 
"specifically bind" herein is meant that the antibodies bind to the protein with a binding constant in the 
5 range of at least 1 0"*- 1 0"® M'^ , with a preferred range being 1 0"' - 1 0"^ M"^ . 

In a prefen-ed embodiment, the angiogenesis protein is purified or isolated after expression. 
Angiogenesis proteins may be isolated or purified in a variety of ways known to those skilled in the art 
depending on what other components are present in the sample. Standard purification methods 
include electrophoretic, molecular, immunological and chromatographic techniques, including ion 

10 exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and chromatofocusing. 
For example, the angiogenesis protein may be purified using a standard anti-angiogenesis protein 
antibody column. Ultrafiltration and diafiltration techniques, in conjunction with protein concentration, 
are also useful. For general guidance in suitable purification techniques, see Scopes, R., Protein 
Purification, Springer- Veriag, NY (1982). The degree of purification necessary will vary depending on 

15 the use of the angiogenesis protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the angiogenesis proteins and nucleic acids are useful in a 
number of applications. 

In one aspect, the expression levels of genes are determined for different cellular states in the 
angiogenesis phenotype; that is, the expression levels of genes in normal tissue (i.e. not undergoing 

2 0 angiogenesis) and in angiogenesis tissue (and in some cases, for varying severities of angiogenesis 
that relate to prognosis, as outlined below) are evaluated to provide expression profiles. An 
expression profile of a particular cell state or point of development is essentially a "fingerprint" of the 
state; while two states may have any particular gene similariy expressed, the evaluation of a number 
of genes simultaneously allows the generation of a gene expression profile that is unique to the state 

2 5 of the cell. By comparing expression profiles of cells in different states, information regarding which 
genes are important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be done or confirmed: does tissue from a particular patient have the 
gene expression profile of normal or angiogenesis tissue. 

"Differential expression," or grammatical equivalents as used herein, refers to both qualitative as well 
30 as quantitative differences in the genes' temporal and/or cellular expression patterns within and 

among the cells. Thus, a differentially expressed gene can qualitatively have its expression altered, 
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including an activation or inactivation, in, for example, normal versus angiogenic tissue. Tliat is, genes 
may be turned on or turned off in a particular state, relative to another state. As is apparent to ttie 
sl<illed artisan, any comparison of two or more states can be made. Sucti a qualitatively regulated 
gene will extiibit an expression pattern within a state or cell type which is detectable by standard 
5 techniques in one such state or cell type, but is not detectable in both. Alternatively, the detemiination 
is quantitative in that expression is increased or decreased; that is, the expression of the gene is either 
upregulated, resulting in an increased amount of transcript, ordownregulated, resulting in a decreased 
amount of transcript. The degree to which expression differs need only be large enough to quantify 
via standard characterization techniques as outlined below, such as by use of Affymetrix GeneChip™ 

10 expression arrays, Lockhart, Nature Biotechnology, 14:1675-1680 (1996), hereby expressly 

incorporated by reference. Other techniques include, but are not limited to, quantitative reverse 
transcriptase PGR, Northern analysis and RNase protection. As outlined above, preferably the change 
in expression (i.e. upregulation or downregulation) is at least about 50%, more preferably at least 
about 100%, more preferably at least about 150%, more preferably, at least about 200%, with from 

1 5 300 to at least 1 000% being especially prefen-ed. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene transcript, 
or the protein level; that is, the amount of gene expression may be monitored using nucleic acid 
probes to the DNA or RNA equivalent of the gene transcript, and the quantification of gene expression 
levels, or, alternatively, the final gene product itself (protein) can be monitored, for example through 
2 0 the use of antibodies to the angiogenesis protein and standard immunoassays (ELISAs, etc.) or other 
techniques. Including mass spectroscopy assays, 2D gel electrophoresis assays, etc. Thus, the 
proteins corresponding to angiogenesis genes, i.e. those identified as being important in an 
angiogenesis phenotype, can be evaluated in an angiogenesis diagnostic test. 

In a preferred embodiment, gene expression monitoring is done and a number of genes, i.e. an 
25 expression profile, is monitored simultaneously, although multiple protein expression monitoring can 
be done as well. Similariy, these assays may be done on an individual basis as well. 

In this embodiment, the angiogenesis nucleic acid probes are attached to biochips as outlined herein 
for the detection and quantification of angiogenesis sequences in a particular cell. The assays are 
further described below in the example. 

30 In a preferred embodiment nucleic acids encoding the angiogenesis protein are detected. Although 
DNA or RNA encoding the angiogenesis protein may be detected, of particular interest are methods 
wherein the mRNA encoding an angiogenesis protein is detected. The presence of mRNA in a 
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sample is an indication that the angiogenesis gene has been transcribed to form the mRNA, and 
suggests that the protein is expressed. Probes to detect the mRNA can be any 
nucleotide/deoxynucleotide probe that is complementary to and base pairs with the mRNA and 
includes but is not limited to oligonucleotides, cDNA or RNA. Probes also should contain a detectable 
5 label, as defined herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the sample. 
Following washing to remove the non-specifically bound probe, the label is detected. In another 
method detection of the mRNA is performed in situ. In this method permeabilized cells or tissue 
samples are contacted with a detectably labeled nucleic acid probe for sufficient time to allow the 
10 probe to hybridize with the target mRNA. Following washing to remove the non-specifically bound 
probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) that is 
complementary to the mRNA encoding an angiogenesis protein is detected by binding the digoxygenin 
with an anti-digoxygenin secondary antibody and developed with nitro blue tetrazolium and 
5-bromo-4-chloro-3-indoyl phosphate. 

15 In a preferred embodiment, any of the three classes of proteins as described herein (secreted, 

transmembrane or intracellular proteins) are used in diagnostic assays. The angiogenesis proteins, 
antibodies, nucleic acids, modified proteins and cells containing angiogenesis sequences are used in 
diagnostic assays. This can be done on an individual gene or corresponding polypeptide level. In a 
preferred embodiment, the expression profiles are used, preferably in conjunction with high throughput 

2 0 screening techniques to allow monitoring for expression profile genes and/or corresponding 

polypeptides. 

As described and defined herein, angiogenesis proteins, including intracellular, transmembrane or 
secreted proteins, find use as markers of angiogenesis. Detection of these proteins in putative 
angiogenesis tissue or patients allows for a determination or diagnosis of angiogenesis. Numerous 
25 methods known to those of ordinary skill in the art find use in detecting angiogenesis. In one 

embodiment, antibodies are used to detect angiogenesis proteins. A preferred method separates 
proteins from a sample or patient by electrophoresis on a gel (typically a denaturing and reducing 
protein gel, but may be any other type of gel including isoelectric focusing gels and the like). Following 
separation of proteins, the angiogenesis protein is detected by immunoblotting with antibodies raised 

3 0 against the angiogenesis protein. Methods of immunoblotting are well known to those of ordinary skill 

in the art. 

In another preferred method, antibodies to the angiogenesis protein find use in in situ imaging 
techniques. In this method cells are contacted with from one to many antibodies to the angiogenesis 
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protein(s). Following washing to remove non-specific antibody binding, the presence of the antibody 
or antibodies is detected. In one embodiment the antibody is detected by incubating with a secondary 
antibody that contains a detectable label. In another method the primary antibody to the angiogenesis 
protein(s) contains a detectable label. In another preferred embodiment each one of multiple primary 
5 antibodies contains a distinct and detectable label. This method finds particular use in simultaneous 
screening for a plurality of angiogenesis proteins. As will be appreciated by one of ordinary skill In the 
art, numerous other histological Imaging techniques are useful in the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability to detect and 
distinguish emissions of different wavelengths. In addition, a fluorescence activated cell sorter (FACS) 

10 can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing angiogenesis from blood samples. 
As previously described, certain angiogenesis proteins are secreted/circulating molecules. Blood 
samples, therefore, are useful as samples to be probed or tested for the presence of secreted 
angiogenesis proteins. Antibodies can be used to detect the angiogenesis by any of the previously 
15 described immunoassay techniques including ELISA, immunoblotting (Westem blotting), 

immunoprecipitation, BIACORE technology and the like, as will be appreciated by one of ordinary skill 
in the art. 

In a preferred embodiment, in situ hybridization of labeled angiogenesis nucleic acid probes to tissue 
arrays is done. For example, arrays of tissue samples, including angiogenesis tissue and/or normal 
2 0 tissue, are made. In situ hybridization as is known in the art can then be done. 

It is understood that when comparing the fingerprints between an individual and a standard, the skilled 
artisan can make a diagnosis as well as a prognosis. It is further understood that the genes which 
indicate the diagnosis may differ from those which indicate the prognosis. 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic acids, modified proteins and 
25 cells containing angiogenesis sequences are used in prognosis assays. As above, gene expression 
profiles can be generated that con-elate to angiogenesis severity, in terms of long term prognosis. 
Again, this may be done on either a protein or gene level, with the use of genes being preferred. As 
above, the angiogenesis probes are attached to biochips for the detection and quantification of 
angiogenesis sequences in a tissue or patient. The assays proceed as outlined above for diagnosis. 
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In a preferred embodiment any of the three classes of proteins as described herein are used in drug 
screening assays. The angiogenesis proteins, antibodies, nucleic acids, modified proteins and cells 
containing angiogenesis sequences are used in drug screening assays or by evaluating the effect of 
drug candidates on a "gene expression profile" or expression profile of polypeptides. In a prefen-ed 
5 embodiment, the expression profiles are used, preferably in conjunction with high throughput 

screening techniques to allow monitoring for expression profile genes after treatment with a candidate 
agent, ZIokamik, et al., Science 279, 84-8 (1998), Heid, 1996 #69. 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic acids, modified proteins and 
cells containing the native or modified angiogenesis proteins are used in screening assays. That is, 
1 0 the present invention provides novel methods for screening for compositions which modulate the 

angiogenesis phenotype. As above, this can be done on an individual gene level or by evaluating the 
effect of drug candidates on a "gene expression profile". In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent, see ZIokamik, supra. 

1 5 Having identified the differentially expressed genes herein, a variety of assays may be executed. In a 
preferred embodiment, assays may be run on an individual gene or protein level. That is, having 
identified a particular gene as up regulated in angiogenesis, candidate bioactive agents may be 
screened to modulate this gene's response; preferably to down regulate the gene, although in some 
circumstances to up regulate the gene. "Modulation" thus includes both an increase and a decrease 

2 0 in gene expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing angiogenesis, with changes of at least 10%, 
preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or greater. Thus, 
if a gene exhibits a 4 fold increase in angiogenic tissue compared to normal tissue, a decrease of 
about four fold is desired; a 10 fold decrease in angiogenic tissue compared to normal tissue gives a 

25 10 fold increase in expression for a candidate agent being desired. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene or the 
protein level; that is, the amount of gene expression may be monitored using nucleic acid probes and 
the quantification of gene expression levels, or, alternatively, the gene product itself can be monitored, 
for example through the use of antibodies to the angiogenesis protein and standard immunoassays. 

30 In a prefen^d embodiment, gene expression monitoring is done and a number of genes, i.e. an 

expression profile, is monitored simultaneously, although multiple protein expression monitoring can 
be done as well. 
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In this embodiment, tlie angiogenesis nucleic acid probes are attached to biochips as outlined herein 
for the detection and quantification of angiogenesis sequences in a particular cell. The assays are 
further described below. 

Generally, in a preferred embodiment, a candidate bioactive agent is added to the cells prior to 
5 analysis. Moreover, screens are provided to identify a candidate bioactive agent which modulates 
angiogenesis, modulates angiogenesis proteins, binds to an angiogenesis protein, or interferes 
between the binding of an angiogenesis protein and an antibody. 

The term "candidate bioactive agent" or "drug candidate" or grammatical equivalents as used herein 
describes any molecule, e.g., protein, oligopeptide, small organic molecule, polysaccharide, 

10 polynucleotide, etc., to be tested for bioactive agents that are capable of directly or indirectly altering 
either the angiogenesis phenotype or the expression of an angiogenesis sequence, Including both 
nucleic acid sequences and protein sequences. In preferred embodiments, the bioactive agents 
modulate the expression profiles, or expression profile nucleic acids or proteins provided herein. In a 
particularly preferred embodiment, the candidate agent suppresses an angiogenesis phenotype, for 

15 example to a normal tissue fingerprint. Similarly, the candidate agent preferably suppresses a severe 
angiogenesis phenotype. Generally a plurality of assay mixtures are run in parallel with different agent 
concentrations to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of detection. 

In one aspect, a candidate agent will neutralize the effect of an angiogenesis protein. By "neutralize" 

2 0 is meant that activity of a protein is either inhibited or counter acted against so as to have substantially 

no effect on a cell. 

Candidate agents encompass numerous chemical classes, though typically they are organic 
molecules, preferably small organic compounds having a molecular weight of more than 1 00 and less 
than about 2,500 daltons. Preferred small molecules are less than 2000, or less than 1500 or less 
25 than 1000 or less than 500 D. Candidate agents comprise functional groups necessary for structural 
interaction with proteins, particulariy hydrogen bonding, and typically include at least an amine, 
carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. The 
candidate agents often comprise cyclical carijon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above functional groups. Candidate 

3 0 agents are also found among biomolecules including peptides, saccharides, fatty acids, steroids, 

purines, pyrimidines, derivatives, structural analogs or combinations thereof. Particulariy preferred are 
peptides. 
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Candidate agents are obtained from a wide variety of sources including libraries of synthetic or natural 
compounds. For example, numerous means are available for random and directed synthesis of a 
wide variety of organic compounds and biomolecules, including expression of randomized 
oligonucleotides. Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant 
5 and animal extracts are^ available or readily produced. Additionally, natural or synthetically produced 
libraries and compounds are readily modified through conventional chemical, physical and 
biochemical means. Known pharmacological agents may be subjected to directed or random 
chemical modifications, such as acylation, alkylation, esterification, amidification to produce structural 
analogs. 

10 In a preferred embodiment, the candidate bioactive agents are proteins. By "protein" herein is meant 
at least two covalently attached amino acids, which includes proteins, polypeptides, oligopeptides and 
peptides. The protein may be made up of naturally occurring amino acids and peptide bonds, or 
synthetic peptidomimetic structures. Thus "amino acid", or "peptide residue", as used herein means 
both naturally occurring and synthetic amino acids. For example, homo-phenylalanine, citrulline and 

15 noreleucine are considered amino acids for the purposes of the invention. "Amino acid" also includes 
imino acid residues such as proline and hydroxyproline. The side chains may be in either the (R) or 
the (S) configuration. In the preferred embodiment, the amino acids are in the (S) or L-configuration. 
If non-naturally occurring side chains are used, non-amino acid substituents may be used, for example 
to prevent or retard in vivo degradations. 

20 In a preferred embodiment, the candidate bioactive agents are naturally occurring proteins or 

fragments of naturally occun-ing proteins. Thus, for example, cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way libraries of 
procaryotic and eucaryotic proteins may be made for screening in the methods of the invention. 
Particulariy preferred in this embodiment are libraries of bacterial, fungal, virai, and mammalian 

25 proteins, with the latter being preferred, and human proteins being especially preferred. 

In a preferred embodiment, the candidate bioactive agents are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to about 15 
being particulariy preferred. The peptides may be digests of naturally occun-ing proteins as is outlined 
above, random peptides, or "biased" random peptides. By "randomized" or grammatical equivalents 
3 0 herein is meant that each nucleic acid and peptide consists of essentially random nucleotides and 

amino acids, respectively. Since generally these random peptides (or nucleic acids, discussed below) 
are chemically synthesized, they may incorporate any nucleotide or amino acid at any position. The 
synthetic process can be designed to generate randomized proteins or nucleic acids, to allow the 
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formation of all or most of the possible combinations over the length of the sequence, thus forming a 
library of randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or constants at any 
position. In a preferred embodiment, the library is biased. That is, some positions within the 
5 sequence are either held constant, or are selected from a limited number of possibilities. For 

example, in a prefen^d embodiment, the nucleotides or amino acid residues are randomized within a 
defined class, for example, of hydrophobic amino acids, hydrophilic residues, sterically biased (either 
small or large) residues, towards the creation of nucleic acid binding domains, the creation of 
cysteines, for cross-linking, prolines for SH-3 domains, serines, threonines, tyrosines or histidines for 
10 phosphorylation sites, etc., or to purines, etc. 

In a preferred embodiment, the candidate bioactive agents are nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid candidate bioactive agents may be naturally 
occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For example, digests 
of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

15 In a preferred embodiment, the candidate bioactive agents are organic chemical moieties, a wide 
variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some period of time, 
the sample containing the target sequences to be analyzed is added to the biochip. If required, the 
target sequence is prepared using known techniques. For example, the sample may be treated to 
2 0 lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or amplification 

such as PGR occurring as needed, as will be appreciated by those in the art. For example, an in vitro 
transcription with labels covalently attached to the nucleosides is done. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, for example, a fluorescent, a 
2 5 chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the target 
sequence's specific binding to a probe. The label also can be an enzyme, such as. alkaline 
phosphatase or horseradish peroxidase, which when provided with an appropriate substrate produces 
a product that can be detected. Altematively, the label can be a labeled compound or small molecule, 
such as an enzyme inhibitor, that binds but is not catalyzed or altered by the enzyme. The label also 
30 can be a moiety or compound, such as, an epitope tag or biotin which specifically binds to streptavidin. 
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For the example of biotin, the streptavidin is labeled as described above, thereby, providing a 
detectable signal for the bound target sequence. As known in the art, unbound labeled streptavidin is 
removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct hybridization assays or can 
5 comprise "sandwich assays", which include the use of multiple probes, as is generally outlined in U.S. 
Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584. 5,571,670, 5,580,731, 5,571,670, 
5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of which are 
hereby incorporated by reference. In this embodiment, in general, the target nucleic acid is prepared 
as outlined above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
1 0 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including high, moderate 
and low stringency conditions as outlined above. The assays are generally run under stringency 
conditions which allows formation of the label probe hybridization complex only in the presence of 
target. Stringency can be controlled by altering a step parameter that is a thermodynamic variable, 
1 5 including, but not limited to, temperature, formamide concentration, salt concentration, chaotropic salt 
concentration pH, organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally outlined in U.S. 
Patent No. 5,681 ,697. Thus it may be desirable to perform certain steps at higher stringency 
conditions to reduce non-specific binding. 

2 0 The reactions outlined herein may be accomplished in a variety of ways, as will be appreciated by 
those in the art. Components of the reaction may be added simultaneously, or sequentially, in any 
order, with prefen-ed embodiments outlined below. In addition, the reaction may include a variety of 
other reagents may be included in the assays. These include reagents like salts, buffers, neutral 
proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal hybridization and 

25 detection, and/or reduce non-specific or background interactions. Also reagents that otherwise 

improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, anti-microbial 
agents, etc., may be used, depending on the sample preparation methods and purity of the target. 

Once the assay is run, the data is analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 
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The screens are done to identify drugs or bioactive agents tliat modulate tine angiogenesis plienotype. 
Specifically, there are several types of screens that can be run. A preferred embodiment is in the 
screening of candidate agents that can induce or suppress a particular expression profile, thus 
preferably generating the associated phenotype. That is, candidate agents that can mimic or produce 
5 an expression profile in angiogenesis similar to the expression profile of normal tissue is expected to 
result in a suppression of the angiogenesis phenotype. Thus, in this embodiment, mimicking an 
expression profile, or changing one profile to another, is the goal. 

In a preferred embodiment, as for the diagnosis applications, having identified the differentially 
expressed genes important in any one state, screens can be run to alter the expression of the genes 
1 0 individually. That is, screening for modulation of regulation of expression of a single gene can be 
done; that is, rather than try to mimic all or part of an expression profile, screening for regulation of 
individual genes can be done. Thus, for example, particularly in the case of target genes whose 
presence or absence is unique between two states, screening is done for modulators of the target 
gene expression. 

15 In a preferred embodiment, screening is done to alter the biological function of the expression product 
of the differentially expressed gene. Again, having identified the importance of a gene in a particular 
state, screening for agents that bind and/or modulate the biological activity of the gene product can he 
run as is more fully outlined below. 

Thus, screening of candidate agents that modulate the angiogenesis phenotype either at the gene 

2 0 expression level or the protein level can be done. 

In addition screens can be done for novel genes that are induced in response to a candidate agent. 
After identifying a candidate agent based upon its ability to suppress an angiogenesis expression 
pattern leading to a normal expression pattern, or modulate a single angiogenesis gene expression 
profile so as to mimic the expression of the gene from normal tissue, a screen as described above can 
25 be performed to identify genes that are specifically modulated in response to the agent. Comparing 
expression profiles between normal tissue and agent treated angiogenesis tissue reveals genes that 
are not expressed in normal tissue or angiogenesis tissue, but are expressed in agent treated tissue. 
These agent specific sequences can be identified and used by any of the methods described herein 
for angiogenesis genes or proteins. In particular these sequences and the proteins they encode find 

3 0 use in marking or identifying agent treated cells. In addition, antibodies can be raised against the 

agent induced proteins and used to target novel therapeutics to the treated angiogenesis tissue 
sample. 
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Thus, in one embodiment, a candidate agent Is administered to a population of angiogenic cells, that 
thus has an associated angiogenesis expression profile. By "administration" or "contacting" herein Is 
meant that the candidate agent Is added to the cells In such a manner as to allow the agent to act 
upon the cell, whether by uptake and intracellular action, or by action at the cell surface. In some 
5 embodiments, nucleic acid encoding a proteinaceous candidate agent (i.e. a peptide) may be put Into 
a viral construct such as a retroviral construct and added to the cell, such that expression of the 
peptide agent Is accomplished; see PCT US97/01019, hereby expressly Incorporated by reference. 

Once the candidate agent has been administered to the cells, the cells can be washed if desired and 
are allowed to incubate under preferably physiological conditions for some period of time. The cells 
10 are then harvested and a new gene expression profile is generated, as outlined herein. 

Thus, for example, angiogenesis tissue may be screened for agents that reduce or suppress the 
angiogenesis phenotype. A change In at least one gene of the expression profile indicates that the 
agent has an effect on angiogenesis activity. By defining such a signature for the angiogenesis 
phenotype, screens for new drugs that alter the phenotype can be devised. With this approach, the 
1 5 drug target need not be known and need not be represented In the original expression screening 
platfomi, nor does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual genes and gene 
products (proteins). That is. having identified a particular differentially expressed gene as Important In 
a particular state, screening of modulators of either the expression of the gene or the gene product 
2 0 Itself can be done. The gene products of differentially expressed genes are sometimes referred to 

herein as "angiogenesis proteins". In preferred embodiments the angiogenesis protein is as depicted 
in Figures 4, 8, 13, 18, and 22 or encoded by the sequences shown in figures 2, 3, 7, 12, 17, 21 and 
23. The angiogenesis protein may be a fragment, or altematlvely, be the full length protein to a 
fragment shown herein. 

2 5 Preferably, the angiogenesis protein is a fragment of approximately 14 to 24 amino acids long. More 
preferably the fragment is a soluble fragment. 

In a prefen-ed embodiment, the fragment Is from AAA1. Preferably, the fragment includes a non- 
transmembrane region. In a preferred embodiment, the AAA1 fragment has an N-termlnal Cys to aid 
in solubility. Preferably, the fragment is selected from AAAIpl and AAA1p2. 
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In a preferred embodiment, the fragment is cliarged and from the c-terminus of AAA4. In one 
embodiment, the c-terminus of the fragment is kept as a free acid and the n-terminus Is a free amine 
to aid in coupling, i.e., to cysteine. In one embodiment the fragment is an internal peptide overlapping 
hydrophilic stretch of AAA4. In a preferred embodiment, the termini is blocked. Preferably, the 
5 fragment of AAA4 is selected from AAA4p1 or AAA4p2. In another preferred embodiment, the 
fragment is a novel fragment from the N-terminal. In one embodiment, the fragment excludes 
sequence outside of the N-terminal, in another embodiment, the fragment includes at least a portion of 
the N-termlnal. "N-terminal" is used interchangeably herein with "N-terminus" which is further 
described above. 

10 In one embodiment the angiogenesis proteins are conjugated to an immunogenic agent as discussed 
herein. In one embodiment the angiogenesis protein is conjugated to BSA. 

Thus, in a prefened embodiment, screening for modulators of expression of specific genes can be 
done. This will be done as outlined above, but in general the expression of only one or a few genes 
are evaluated. 

15 In a preferred embodiment, screens are designed to first find candidate agents that can bind to 
differentially expressed proteins, and then these agents may be used in assays that evaluate the 
ability of the candidate agent to modulate differentially expressed activity. Thus, as will be appreciated 
by those in the art, there are a number of different assays which may be run; binding assays and 
activity assays. 

2 0 In a preferred embodiment, binding assays are done. In general, purified or isolated gene product is 
used; that is, the gene products of one or more differentially expressed nucleic acids are made. In 
general, this is done as is known in the art. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Altematively, cells comprising the angiogenesis proteins can be used in the assays. 

2 5 Thus, in a preferred embodiment, the methods comprise combining an angiogenesis protein and a 
candidate bioactive agent, and determining the binding of the candidate agent to the angiogenesis 
protein. Preferred embodiments utilize the human angiogenesis protein, although other mammalian 
proteins may also be used, for example for the development of animal models of human disease. In 
some embodiments, as outlined herein, variant or derivative angiogenesis proteins may be used. 
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Generally, in a preferred embodiment of the methods herein, the angiogenesis protein or the 
candidate agent is non-diffusably bound to an insoluble support having isolated sample receiving 
areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be made of any composition 
to which the compositions can be bound, is readily separated from soluble material, and is otherwise 
5 compatible with the overall method of screening. The surface of such supports may be solid or porous 
and of any convenient shape. Examples of suitable insoluble supports include microtiter plates, 
arrays, membranes and beads. These are typically made of glass, plastic (e.g., polystyrene), 
polysaccharides, nylon or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially 
convenient because a large number of assays can be carried out simultaneously, using small amounts 

10 of reagents and samples. The particular manner of binding of the composition is not crucial so long 
as it is compatible with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of antibodies (which 
do not sterically block either the ligand binding site or activation sequence when the protein is bound to 
the support), direct binding to "sticl<y" or ionic supports, chemical crosslinking, the synthesis of the 

1 5 protein or agent on the surface, etc. Following binding of the protein or agent, excess unbound 

material is removed by washing. The sample receiving areas may then be blocked through incubation 
with bovine serum albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the angiogenesis protein is bound to the support, and a candidate 
bioactive agent is added to the assay. Alternatively, the candidate agent is bound to the support and 

2 0 the angiogenesis protein is added. Novel binding agents include specific antibodies, non-natural 

binding agents identified in screens of chemical libraries, peptide analogs, etc. Of particular interest 
are screening assays for agents that have a low toxicity for human cells. A wide variety of assays may 
be used for this purpose, including labeled in vitro protein-protein binding assays, electrophoretic 
mobility shift assays, immunoassays for protein binding, functional assays (phosphorylation assays, 
25 etc.) and the like. 

The determination of the binding of the candidate bioactive agent to the angiogenesis protein may be 
done in a number of ways. In a preferred embodiment, the candidate bioactive agent is labelled, and 
binding determined directly. For example, this may be done by attaching all or a portion of the 
angiogenesis protein to a solid support, adding a labelled candidate agent (for example a fluorescent 

3 0 label), washing off excess reagent, and determining whether the label is present on the solid support. 

Various blocking and washing steps may be utilized as is known in the art. 

By "labeled" herein is meant that the compound is either directly or indirectly labeled with a label which 
provides a detectable signal, e.g. radioisotope, fluorescers, enzyme, antibodies, particles such as 
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magnetic particles, chemiiuminescers, or specific binding molecules, etc. Specific binding molecules 
include pairs, sucli as biotin and streptavidin, digoxin and antidigoxin etc. For the specific binding 
members, tlie complementary member would normally be labeled witti a molecule whicli provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or indirectly 
5 provide a detectable signal. 

In some embodiments, only one of the components is labeled. For example, the proteins (or 
proteinaceous candidate agents) may be labeled at tyrosine positions using '^^1, or with fluorophores. 
Alternatively, more than one component may be labeled with different labels; using ^''^l for the proteins, 
for example, and a fluorophor for the candidate agents. 

10 In a preferred embodiment, the binding of the candidate bioactive agent is determined through the use 
of competitive binding assays. In this embodiment, the competitor is a binding moiety l<nown to bind 
to the target molecule (i.e. angiogenesis), such as an antibody, peptide, binding partner, ligand, etc. 
Under certain circumstances, there may be competitive binding as between the bioactive agent and 
the binding moiety, with the binding moiety displacing the bioactive agent. 

15 In one embodiment, the candidate bioactive agent is labeled. Either the candidate bioactive agent, or 
the competitor, or both, is added first to the protein for a time sufficient to allow binding, if present. 
Incubations may be performed at any temperature which facilitates optimal activity, typically between 4 
and 40°C. Incubation periods are selected for optimum activity, but may also be optimized to facilitate 
rapid high through put screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent 

20 is generally removed or washed away. The second component is then added, and the presence or 
absence of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the candidate bioactive agent. 
Displacement of the competitor is an indication that the candidate bioactive agent is binding to the 
angiogenesis protein and thus is capable of binding to, and potentially modulating, the activity of the 
25 angiogenesis protein. In this embodiment, either component can be labeled. Thus, for example, if the 
competitor is labeled, the presence of label in the wash solution indicates displacement by the agent. 
Alternatively, if the candidate bioactive agent is labeled, the presence of the label on the support 
indicates displacement. 

In an alternative embodiment, the candidate bioactive agent is added first, with incubation and 
3 0 washing, followed by the competitor. The absence of binding by the competitor may indicate that the 
bioactive agent is bound to the angiogenesis protein with a higher affinity. Thus, if the candidate 
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bioactive agent is labeled, the presence of the label on the support, coupled with a lacl< of competitor 
binding, may indicate that the candidate agent is capable of binding to the angiogenesis protein. 

In a preferred embodiment, the methods comprise differential screening to identity bioactive agents 
that are capable of modulating the activitity of the angiogenesis proteins. In this embodiment, the 
5 methods comprise combining an angiogenesis protein and a competitor in a first sample. A second 
sample comprises a candidate bioactive agent, an angiogenesis protein and a competitor. The 
binding of the competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding to the angiogenesis 
protein and potentially modulating its activity. That is. if the binding of the competitor is different in the 
10 second sample relative to the first sample, the agent is capable of binding to the angiogenesis protein. 

Alternatively, a preferred embodiment utilizes differential screening to identify drug candidates that 
bind to the native angiogenesis protein, but cannot bind to modified angiogenesis pnateins. The 
structure of the angiogenesis protein may be modeled, and used in rational drug design to synthesize 
agents that interact with that site. Drug candidates that affect angiogenesis bioactivity are also 
15 identified by screening drugs for the ability to either enhance or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably all control and test 
samples are performed in at least triplicate to obtain statistically significant results. Incubation of all 
samples is for a time sufficient for the binding of the agent to the protein. Following incubation, all 
samples are washed free of non-specifically bound material and the amount of bound, generally 
2 0 labeled agent determined. For example, where a radiolabel is employed, the samples may be 
counted in a scintillation counter to determine the amount of bound compound. 

A variety of other reagents may be included in the screening assays. These include reagents like 
salts, neutral proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal 
protein-protein binding and/or reduce non-specific or background interactions. Also reagents that 
25 otherwise improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, 

anti-microbial agents, etc., may be used. The mixture of components may be added in any order that 
provides for the requisite binding. 

Screening for agents that modulate the activity of angiogenesis proteins may also be done. In a 
preferred embodiment, methods for screening for a bioactive agent capable of modulating the activity 
30 of angiogenesis proteins comprise the steps of adding a candidate bioactive agent to a sample of 
angiogenesis proteins, as above, and determining an alteration in the biological activity of 
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angiogenesis proteins. "Modulating tine activity of angiogenesis proteins" includes an increase in 
activity, a decrease in activity, or a change in the type or kind of activity present. Thus, in this 
embodiment, the candidate agent should both bind to angiogenesis proteins(although this may not be 
necessary), and alter its biological or biochemical activity as defined herein. The methods include 
5 both in vitro screening methods, as are generally outlined above, and in vivo screening of cells for 
alterations in the presence, distribution, activity or amount of angiogenesis proteins. 

Thus, in this embodiment, the methods comprise combining an angiogenesis sample and a candidate 
bioactive agent, and evaluating the effect on angiogenesis. By "angiogenesis activity" or grammatical 
equivalents herein is meant one of angiogenesis's biological activities, including, but not limited to, its 
1 0 role in angiogenesis. In one embodiment, angiogenesis activity includes activation of AAA4, AAA1 , 
Edg-1, alpha 5 beta1 integrin, endomucin and matrix metalloproteinase 10. An inhibitor of 
angiogenesis activity is the inhibition of any one or more angiogenesis activities. 

In a preferred embodiment, the activity of the angiogenesis protein is increased; in another preferred 
embodiment, the activity of the angiogenesis protein is decreased. Thus, bioactive agents that are 
1 5 antagonists are prefen-ed in some embodiments, and bioactive agents that are agonists may be 
preferred in other embodiments. 

In a preferred embodiment, the invention provides methods for screening for bioactive agents capable 
of modulating the activity of an angiogenesis protein. The methods comprise adding a candidate 
bioactive agent, as defined above, to a cell comprising angiogenesis proteins. Preferred cell types 
2 0 include almost any cell. The cells contain a recombinant nucleic acid that encodes an angiogenesis 
protein. In a preferred embodiment, a library of candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or subsequent 
exposure of physiological signals, for example hormones, antibodies, peptides, antigens, cytokines, 
grow/th factors, action potentials, pharmacological agents including chemotherapeutics, radiation, 

2 5 carcinogenics. or other cells (i.e. cell-cell contacts). In another example, the determinations are 

determined at different stages of the cell cycle process. 

In this way, bioactive agents are identified. Compounds with pharmacological activity are able to 
enhance or interfere with the activity of the angiogenesis protein. In one embodiment, "angiogenesis 
protein activity" as used herein includes at least one of the following: angiogenesis protein activity as 

3 0 defined herein, binding to Edg-1 , activation of Edg-1 , or activation of substrates of Edg-1 . In one 

embodiment, angiogenesis activity is defined as the unregulated proliferation of angiogenic tissue, or 
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the growth of arteries in tissue. In one aspect, angiogenesis activity as defined herein is related to the 
activity of Edg-1 in the upregulation of Edg-1 in angiogenic tissue. 

In another embodiment, angiogenesis protein activity includes at least one of the following: 
angiogenesis activity, binding to one of AAA4, AAA1, Edg-1, alpha 5 beta 1 Integrin, endomucin, 
5 matrix metalloproteinase 10, or activation of substrates of AAA4, AAA1 , Edg-1 , alpha 5 beta 1 integrin, 
endomucin, matrix metalloproteinase 10, respectively. In one prefen-ed embodiment, AAA1 comprises 
its N-terminal end. In one aspect, angiogenesis activity as defined herein is related to the activity of 
AAA4, AAA1, Edg-1, alpha 5 beta 1 integrin, endomucin, matrix metalloproteinase 10, in the 
upregulation of AAA4, AAA1, Edg-1, alpha 5 beta 1 integrin, endomucin, matrix metalloproteinase 10, 
1 0 respectively in angiogenesis tissue. 

In one embodiment, a method of inhibiting angiogenic cell division is provided. The method comprises 
administration of a angiogenesis Inhibitor. 

In another embodiment, a method of inhibiting angiogenesis is provided. The method comprises 
administration of an angiogenesis inhibitor. 

15 In a further embodiment, methods of treating cells or individuals with angiogenesis are provided. The 
method comprises administration of an angiogenesis inhibitor. 

In one embodiment, an angiogenesis inhibitor is an antibody as discussed above. In another 
embodiment, the angiogenesis inhibitor is an antisense molecule. Antisense molecules as used 
herein include antisense or sense oligonucleotides comprising a singe-stranded nucleic acid sequence 
2 0 (either RNA or DNA) capable of binding to target mRNA (sense) or DNA (antisense) sequences for 
angiogenesis molecules. A preferred antisense molecule is for AAA4, AAA1 , Edg-1 , alpha 5 beta 1 
integrin, endomucin, or matrix metalloproteinase 10, more preferable the angiogenesis sequences in 
Table 5, or for a ligand or activator thereof. A most preferred antisense molecule is for Edg-1 or for a 
ligand or activator thereof. Antisense or sense oligonucleotides, according to the present invention, 

2 5 comprise a fragment generally at least about 14 nucleotides, preferably from about 14 to 30 

nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon a cDNA 
sequence encoding a given protein is described in, for example. Stein and Cohen (Cancer Res. 
48:2659, 1988) and van der Krol et al. (BioTechnigues 6:958. 1988). 

Antisense molecules may be introduced into a cell containing the target nucleotide sequence by 

3 0 formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
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ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other 
cytokines, or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand 
binding molecule does not substantially interfere with the ability of the ligand binding molecule to bind 
to its con-esponding molecule or receptor, or block entry of the sense or antisense oligonucleotide or 
5 its conjugated version into the cell. Alternatively, a sense or an antisense oligonucleotide may be 

introduced into a cell containing the target nucleic acid sequence by formation of an oligonucleotide- 
lipid complex, as described in WO 90/10448. It is understood that the use of antisense molecules or 
knock out and knock in models may also be used in screening assays as discussed above, in addition 
to methods of treatment. 

10 The compounds having the desired pharmacological activity may be administered in a physiologically 
acceptable carrier to a host, as previously described. The agents may be administered in a variety of 
ways, orally, parenterally e.g., subcutaneously, intraperitoneally, intravasculariy, etc. Depending upon 
the manner of introduction, the compounds may be formulated in a variety of ways. The concentration 
of therapeutically active compound in the formulation may vary from about 0.1-100 wt.%. The agents 

1 5 may be administered alone or in combination with other treatments, i.e., radiation. 

The pharmaceutical compositions can be prepared in various forms, such as granules, tablets, pills, 
suppositories, capsules, suspensions, salves, lotions and the like. Pharmaceutical grade organic or 
inorganic carriers and/or diluents suitable for oral and topical use can be used to make up 
compositions containing the therapeutically-active compounds. Diluents known to the art include 
20 aqueous media, vegetable and animal oils and fats. Stabilizing agents, wetting and emulsifying 

agents, salts for varying the osmotic pressure or buffers for securing an adequate pH value, and skin 
penetration enhancers can be used as auxiliary agents. 

Without being bound by theory, it appears that the various angiogenesis sequences are important in 
angiogenesis. Accordingly, disorders based on mutant or variant angiogenesis genes may be 

2 5 determined. In one embodiment, the invention provides methods for identifying cells containing 

variant angiogenesis genes comprising determining all or part of the sequence of at least one 
endogeneous angiogenesis genes in a cell. As will be appreciated by those in the art, this may be 
done using any number of sequencing techniques. In a preferred embodiment, the invention provides 
methods of identifying the angiogenesis genotype of an individual comprising determining all or part of 

3 0 the sequence of at least one angiogenesis gene of the individual. This is generally done in at least 

one tissue of the individual, and may include the evaluation of a number of tissues or different samples 
of the same tissue. The method may include comparing the sequence of the sequenced angiogenesis 
gene to a known angiogenesis gene, i.e. a wild-type gene. 
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The sequence of all or part of the angiogenesis gene can then be compared to the sequence of a 
known angiogenesis gene to determine if any differences exist. This can be done using any number 
of known homology programs, such as Bestfit, etc. In a preferred embodiment, the presence of a a 
difference in the sequence between the angiogenesis gene of the patient and the known angiogenesis 
5 gene is indicative of a disease state or a propensity for a disease state, as outlined herein. 

In a preferred embodiment, the angiogenesis genes are used as probes to determine the number of 
copies of the angiogenesis gene in the genome. 

In another preferred embodiment, the angiogenesis genes are used as probes to determine the 
chromosomal localization of the angiogenesis genes. Information such as chromosomal localization 
1 0 finds use in providing a diagnosis or prognosis in particular when chromosomal abnormalities such as 
translocations, and the like are identified in the angiogenesis gene locus. 

Thus, in one embodiment, methods of modulating angiogenesis in cells or organisms are provided. In 
one embodiment, the methods comprise administering to a cell an anti-angiogenesis antibody that 
reduces or eliminates the biological activity of an endogeneous angiogenesis protein. Alternatively. 

1 5 the methods comprise administering to a cell or organism a recombinant nucleic acid encoding an 
angiogenesis protein. As will be appreciated by those in the art, this may be accomplished in any 
number of ways. In a preferred embodiment, for example when the angiogenesis sequence is down- 
regulated in angiogenesis, the activity of the angiogenesis gene is increased by increasing the amount 
of angiogenesis in the cell, for example by overexpressing the endogeneous angiogenesis or by 

2 0 administering a gene encoding the angiogenesis sequence, using known gene-therapy techniques, for 
example. In a preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), for example as described in 
PCT/US93/03868, hereby incorporated by reference in its entireity. Alternatively, for example when 
the angiogenesis sequence is up-regulated in angiogenesis, the activity of the endogeneous 

2 5 angiogenesis gene is decreased, for example by the administration of a angiogenesis antisense 

nucleic acid. 

In one embodiment, the angiogenesis proteins of the present invention may be used to generate 
polyclonal and monoclonal antibodies to angiogenesis proteins, which are useful as described herein. 
Similarly, the angiogenesis proteins can be coupled, using standard technology, to affinity 

3 0 chromatography columns. These columns may then be used to purify angiogenesis antibodies. In a 

preferred embodiment, the antibodies are generated to epitopes unique to a angiogenesis protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. These antibodies find use in a 
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number of applications. For example, the angiogenesis antibodies may be coupled to standard affinity 
chromatography columns and used to purify angiogenesis proteins. The antibodies may also be used 
as blocking polypeptides, as outlined above, since they will specifically bind to the angiogenesis 
protein. 

5 In one embodiment, a therapeutically effective dose of an angiogenesis proteins and modulator 

thereof is administered to a patient. By "therapeutically effective dose" herein is meant a dose that 
produces the effects for which it is administered. The exact dose will depend on the purpose of the 

treatment, and will be ascertainable by one skilled in the art using known techniques. As is known in 
the art, adjustments for angiogenesis degradation, systemic versus localized delivery, and rate of new 
10 protease synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
drug interaction and the severity of the condition may be necessary, and will be ascertainable with 
routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other animals, 
particulariy mammals, and organisms. Thus the methods are applicable to both human therapy and 
15 veterinary applications. In the preferred embodiment the patient is a mammal, and in the most 
preferred embodiment the patient is human. 

The administration of the angiogenesis proteins and modulators thereof of the present invention can 
be done in a variety of ways as discussed above, including, but not limited to, orally, subcutaneously, 
intravenously, intranasally, transdermally, intraperitoneally, intramusculariy, intrapulmonary, vaginally, 
2 0 rectally, or intraocularly. In some instances, for example, in the treatment of wounds and 

inflammation, the angiogenesis proteins and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise an angiogenesis protein in a form 
suitable for administration to a patient. In the preferred embodiment, the pharmaceutical compositions 
are in a water soluble form, such as being present as pharmaceutically acceptable salts, which is 

2 5 meant to include both acid and base addition salts. "Pharmaceutically acceptable acid addition salt" 

refers to those salts that retain the biological effectiveness of the free bases and that are not 
biologically or otherwise undesirable, formed with inorganic acids such as hydrochloric acid, 
hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as 
acetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 

3 0 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases such as 
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sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, manganese, 
aluminum salts and the like. Particularly preferred are the ammonium, potassium, sodium, calcium, 
and magnesium salts. Salts derived from pharmaceutically acceptable organic non-toxic bases 
include salts of primary, secondary, and tertiary amines, substituted amines including naturally 
occurring substituted amines, cyclic amines and basic ion exchange resins, such as isopropylamine, 
trimethylamine, diethylamine, triethylamine, tripropylamine, and ethanolamine. 

The phamiaceutical compositions may also include one or more of the following: carrier proteins such 
as serum albumin; buffers: fillers such as microcrystalline cellulose, lactose, corn and other starches; 
binding agents; sweeteners and other flavoring agents; coloring agents; and polyethylene glycol. 

Additives are well known in the art, and are used in a variety of formulations. 

In a preferred embodiment, angiogenesis proteins and modulators are administered as therapeutic 
agents, and can be formulated as outlined above. Similarly, angiogenesis genes (including both the 
full-length sequence, partial sequences, or regulatory sequences of the angiogenesis coding regions) 
can be administered in gene therapy applications, as is known in the art. These angiogenesis genes 
can include antisense applications, either as gene therapy (i.e. for incorporation into the genome) or 
as antisense compositions, as will be appreciated by those in the art. 

In a preferred embodiment, angiogenesis genes are administered as DNA vaccines, either single 
genes or combinations of angiogenesis genes. Naked DNA vaccines are generally known in the art. 
Brower, Nature Biotechnology, 16:1304-1305 (1998). 

In one embodiment, angiogenesis genes of the present invention are used as DNA vaccines. 
Methods for the use of genes as DNA vaccines are well known to one of ordinary skill in the art, and 
include placing an angiogenesis gene or portion of an angiogenesis gene under the control of a 
promoter for expression in an angiogenesis patient. The angiogenesis gene used for DNA vaccines 
can encode full-length angiogenesis proteins, but more preferably encodes portions of the 
angiogenesis proteins including peptides derived from the angiogenesis protein. In a preferred 
embodiment a patient is immunized with a DNA vaccine comprising a plurality of nucleotide 
sequences derived from an angiogenesis gene. Similarly, it is possible to immunize a patient with a 
plurality of angiogenesis genes or portions thereof as defined herein. Without being bound by theory, 
expression of the polypeptide encoded by the DNA vaccine, cytotoxic T-cells, helper T-cells and 
antibodies are induced which recognize and destroy or eliminate cells expressing angiogenesis 
proteins. 
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In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant molecule with the 
DNA vaccine. Such adjuvant molecules include cytokines that increase the immunogenic response to 
the angiogenesis polypeptide encoded by the DNA vaccine. Additional or alternative adjuvants are 
known to those of ordinary skill in the art and find use in the invention. 

In another prefen-ed embodiment angiogenesis genes find use in generating animal models of 
angiogenesis. As is appreciated by one of ordinary skill in the art, when the angiogenesis gene 
identified is repressed or diminished in angiogenesis tissue, gene therapy technology wherein 
antisense RNA directed to the angiogenesis gene will also diminish or repress expression of the gene. 
An animal generated as such serves as an animal model of angiogenesis that finds use in screening 
bioactive drug candidates. Similarly, gene knockout technology, for example as a result of 
homologous recombination with an appropriate gene targeting vector, will result in the absence of the 
angiogenesis protein. When desired, tissue-specific expression or knockout of the angiogenesis 
protein may be necessary. 

It is also possible that the angiogenesis protein is overexpressed in angiogenesis. As such, 
transgenic animals can be generated that overexpress the angiogenesis protein. Depending on the 
desired expression level, promoters of various strengths can be employed to express the transgene. 
Also, the number of copies of the integrated transgene can be determined and compared for a 
determination of the expression level of the transgene. Animals generated by such methods find use 
as animal models of angiogenesis and are additionally useful in screening for bioactive molecules to 
treat angiogenesis. 

It is understood that the examples described above In no way serve to limit the true scope of this 
invention, but rather are presented for illustrative purposes. All references and sequences of 
accession numbers cited herein are incorporated by reference in their entirety. 

EXAMPLES 
Example 1 

Tissue Preparation. Labeling Chips, and Fingerprints 
Purify total RNA from tissue using TRIzol Reagent 

Estimate tissue weight Homogenize tissue samples in 1ml of TRIzol per 50mg of tissue using a 
Polytron 3100 homogenizer. The generator/probe used depends upon the tissue size. A 
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generator that is too large for the amount of tissue to be homogenized will cause a loss of sample 
and lower RNA yield. Use the 20mm generator for tissue weighing more than 0.6g. If the working 
volume is greater than 2ml, then homogenize tissue in a 15ml polypropylene tube (Falcon 2059). 
Fill tube no greater than 10ml. 

5 HQMOGENIZATION 

Before using generator, it should have been cleaned after last usage by running it through soapy 
H20 and rinsing thoroughly. Run through with EtOH to sterilize. Keep tissue frozen until ready. 
Add TRIzol directly to frozen tissue then homogenize. 

Following homogenization, remove insoluble material from the homogenate by centrifugation at 
10 7500 X g for 15 min. in a Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge 
at 4°C. Transfer the cleared homogenate to a new tube(s). The samples may be frozen now at - 
60 to -70°C (and kept for at least one month) or you may continue with the purification. 

PHASE SEPARATION 

Incubate the homogenized samples for 5 minutes at room temperature. 
1 5 Add 0.2ml of chloroform per 1 ml of TRIzol reagent used in the original homogenization. 
Cap tubes securely and shake tubes vigorously by hand (do not vortex) for 15 seconds. 
Incubate samples at room temp, for 2-3 minutes. Centrifuge samples at 6500rpm in a Sorvall 
superspeed for 30 min. at 4'*C. (You may spin at up to 12,000 x g for 10 min. but you risk 
breaking your tubes in the centrifuge.) 

20 RNA PRECIPITATION 

Transfer the aqueous phase to a fresh tube. Save the organic phase if isolation of DNA or protein 
is desired. Add 0.5ml of isopropyl alcohol per 1ml of TRIzol reagent used in the original 
homogenization. Cap tubes securely and invert to mix. Incubate samples at room temp, for 10 
minutes. Centrifuge samples at 6500rpm in Sorvall for 20min. at 4°C. 

25 RNA WASH 

Pour off the supemate. Wash pellet with cold 75% ethanol. Use 1ml of 75% ethanol per 1 ml of 
TRIzol reagent used in the initial homogenization. Cap tubes securely and invert several times to 
loosen pellet. (Do not vortex). Centrifuge at <8000rpm (<7500 x g) for 5 minutes at 4°C. 
Pour off the wash. Carefully transfer pellet to an eppendorf tube (let it slide down the tube into the 
3 0 new tube and use a pipet tip to help guide it in if necessary). Depending on the volumes you are 
working with, you can decide what size tube(s) you want to precipitate the RNA in. When I tried 
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leaving the RNA in the large 15ml tube, it took so long to dry (i.e. it did not dry) that I eventually 
had to transfer it to a smaller tube. Let pellet dry in hood. Resuspend RNA in an appropriate 
volume of DEPC H2O. Try for 2-5ug/ul. Take absorbance readings. 

Purifv polv A+ mRNA from total RNA or clean up total RNA with Qiaaen' s 
5 RNeasv kit 

Purification of poly A* mRNA from total RNA. Heat oligotex suspension to 37°C and mix 
immediately before adding to RNA. Incubate Elution Buffer at 70°C. Warm up 2 x Binding Buffer 
at 65°C if there is precipitate in the buffer. Mix total RNA with DEPC-treated water, 2 x Binding 
Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook. Incubate for 3 
1 0 minutes at 65°G. Incubate for 10 minutes at room temperature. 

Centrifuge for 2 minutes at 14,000 to 18,000 g. If centrifuge has a "soft setting," then use it. 
Remove supematant without disturbing Oligotex pellet. A little bit of solution can be left behind to 
reduce the loss of Oligotex. Save sup until certain that satisfactory binding and elution of poly A* 
mRNA has occun-ed. 

1 5 Gently resuspend in Wash Buffer 0W2 and pipet onto spin column. Centrifuge the spin column 
at full speed (soft setting if possible) for 1 minute. 

Transfer spin column to a new collection tube and gently resuspend in Wash Buffer 0W2 and 
centrifuge as describe herein. 

Transfer spin column to a new tube and elute with 20 to 100 ul of preheated (70°C) Elution Buffer. 
2 0 Gently resuspend Oligotex resin by pipetting up and down. Centrifuge as above. Repeat elution 
with fresh elution buffer or use first eluate to keep the elution volume low. 

Read absorbance, using diluted Elution Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA must be precipitated. 
Some component leftover or in the Elution Buffer from the Oligotex purification procedure will 
2 5 inhibit downstream enzymatic reactions of the mRNA. 

Ethanol Precipitation 

Add 0.4 vol. of 7.5 M NH4OAC + 2.5 vol. of cold 100% ethanol. Precipitate at -20°C 1 hour to 
ovemight (or 20-30 min. at -70°C). Centrifuge at 14,000-16,000 x g for 30 minutes at 4°C. Wash 
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pellet with 0.5ml of 80%ethanol (-20°C) then centrifuge at 14,000-16,000 x g for 5 minutes at room 
temperature. Repeat 80% ethanol wash. Dry the last bit of ethanol from the pellet in the hood. 
(Do not speed vacuum). Suspend pellet in DEPC H2O at lug/ul concentration. 

Clean up total RNA using Qiaqen's RNeasv kit 
5 Add no more than lOOug to an RNeasy column. Adjust sample to a volume of 100ul with RNase- 
free water. Add 350ul Buffer RLT then 250ul ethanol (100%) to the sample. Mix by pipetting (do 
not centrifuge) then apply sample to an RNeasy mini spin column. Centrifuge for 1 5 sec at 
>10,000rpm. If concemed about yield, re-apply flowthrough to column and centrifuge again. 
Transfer column to a new 2-ml collection tube. Add 500ul Buffer RPE and centrifuge for 15 sec 

10 at >10,000rpm. Discard flowthrough. Add 500ul Buffer RPE and centrifuge for 15 sec at 

>10,000rpm. Discard flowthrough then centrifuge for 2 min at maximum speed to dry column 
membrane. Transfer column to a new 1.5-ml collection tube and apply 30-50ul of RNase-free 
water directly onto column membrane. Centrifuge 1 min at >10,000rpm. Repeat elution. 
Tal<e absorbance reading. If necessary, ethanol precipitate with ammonium acetate and 2.5X 

15 volume 100% ethanol. 

Make cDNA using Gibco's "Superscript Choice System for cDNA Svnthesis" kit 
First Strand cDNA Synthesis 

Use 5ug of total RNA or 1 ug of polyA+ mRNA as starting material. For total RNA, use 2ul of 
Superscript RT. For polyA+ mRNA, use 1 ul of Superscript RT. Final volume of first strand 
2 0 synthesis mix is 20ul. RNA must be in a volume no greater than 1 Qui. Incubate RNA with 1 ul of 
lOOpmol T7-T24 oligofor 10 min at70C. On ice, add 7 ul of: 4ul5X1" Strand Buffer, 2ul of 
0.1 M DTT, and 1 ulof lOmM dNTP mix. Incubate at 37C for 2 min then add Superscript RT 
Incubate at 37C for 1 hour. 

Second Strand Svnthesis 

2 5 Place 1*' strand reactions on ice. 

Add: 91UIDEPCH20 

30ul 5X 2"" Strand Buffer 

3ul 10mM dNTP mix 

lul lOU/ul £.co// DNA Ligase 

3 0 4ul lOU/ul £.co// DNA Polymerase 

1ul2U/ul RNaseH 

Make the above into a mix if there are more than 2 samples. Mix and incubate 2 hours at 16C. 
61 



wo 01/11086 



PCT/USOO/22061 



Add 2ul T4 DNA Polymerase. Incubate 5 min at 16C. Add 10ul of 0.5M EDTA 

Clean up cDNA 

Phenol:Chloroform:lsoamyl Alcohol (25:24:1) purification using Phase-Lock gel tubes: 
Centrifuge PLG tubes for 30 sec at maximum speed. Transfer cDNA mix to PLG tube. Add equal 
5 volume of phenol:chlaroform:isamyl alcohol and shake vigorously (do not vortex). Centrifuge 5 
minutes at maximum speed. Transfer top aqueous solution to a new tube. Ethanol precipitate: 
add 7.5X 5M NH40ac and 2.5X volume of 100% ethanol. Centrifuge immediately at room temp, 
for 20 min, maximum speed. Remove sup then wash pellet 2X with cold 80% ethanol. Remove 
as much ethanol wash as possible then let pellet air dry. Resuspend pellet in 3ul RNase-free 



In vitro Transcription (IVT) and labeling with biotin 
Pipet 1.5ul of cDNA into a thin-wall PGR tube. 

Make NTP labeling mix: 

Combine at room temperature: 2ul T7 lOxATP (75mM) (Ambion) 

2ul T7 1 0xGTP (75mlVI) (Ambion) 

1 .5ul T7 lOxCTP (75mM) (Ambion) 

1 .5ul T7 1 0xUTP (75mM) (Ambion) 

3.75ul lOmM Bio-1 1-UTP (Boehringer-Mannheim/Roche or 
Enzo) 

3.75ul 1 0mM Bio-1 6-CTP (Enzo) 

2ul lOx T7 transcription buffer (Ambion) 

2ul 10x T7 enzyme mix (Ambion) 

Final volume of total reaction is 20ul. Incubate 6 hours at 37C in a PGR machine. 

RNeasv clean-up of IVT product 

Follow previous instructions for RNeasy columns or refer to QIagen's RNeasy protocol handbook. 

cRNA will most likely need to be ethanol precipitated. Resuspend in a volume compatible with 
the fragmentation step. 
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Fragmentation 
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15 ug of labeled RNA is usually fragmented. Try to minimize the fragmentation reaction volume; 
a 10 ul volume is recommended but 20 ul is all right. Do not go higher than 20 ul because the 
magnesium in the fragmentation buffer contributes to precipitation in the hybridization buffer. 
Fragment RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer. 

5 5 X Fraamentatlon buffer: 

200 mM Tris-acetate, pH 8.1 
500 mM KOAc 
150 mM MgOAc 

The labeled RNA transcript can be analyzed before and after fragmentation. Samples can be 
1 0 heated to 65C for 1 5 minutes and electrophoresed on 1 % agarose/TBE gels to get an 
approximate idea of the transcript size range 

Hybridization 

200 ul (lOug cRNA) of a hybridization mix is put on the chip. If multiple hybridizations are to be 
done (such as cycling through a 5 chip set), then it is recommended that an initial hybridization 
1 5 mix of 300 ul or more be made. 

Hybrization Mix: fragment labeled RNA (50ng/ul final cone.) 
50 pM 948-b control oligo 
1.5 pM BioB 
5 pM BloC 
2 0 25 pM BioD 

100 pM CRE 

O.lmg/ml hening sperm DNA 

0.5mg/ml acetylated BSA 

to 300 ul with IxMES hyb. buffer 

2 5 The instruction manuals for the products used herein are incorporated herein in their entirety. 

Labeling Protocol Provided Herein 

Hybridization reaction: 

Start w^ith non-biotinylated IVT (purified by RNeasy columns) 
(see example 1 for steps from tissue to IVT) 
30 IVT antisense RNA; 4 |jg: |jl 
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Random Hexamers (1 |Jg/|Jl): 4 |Jl 
H2O: |Jl 

14 Ml 



5 - Incubate 70°C, 10 min. Put on ice. 

Reverse transcription: 

5X First Strand (BRL) buffer: 6 pi 



0.1MDTT: 3 pi 

SOX dNTP mix: 0.6 |jl 

10 H20: 2.4 Ml 

Cy3orCy5dUTP(1mM): 3 pi 

SS RT II (BRL): 1 Ml 



16 Ml 

15 - Add to hybridization reaction. 

- Incubate 30 min., 42''C. 

- Add 1 M' SSII and let go for another hour. 

Put on ice. 

- SOX dNTP mix (2SmlVl of cold dATP, dCTP, and dGTP, lOmM of dTTP: 25 \M each of lOOmM 

2 0 dATP, dCTP, and dGTP; 10 Ml of lOOmM dTTP to 15 Ml H20. dNTPs from Pharmacia) 

RNA degradation: 

86 Ml H2O 

-Add 1.5 Ml 1M NaOH/2mM EDTA, incubate at65°C. 10 min. 10 Ml 1 0N NaOH 

4 Ml 50mM EDTA 

25 U-Con 30 

SOO Ml TE/sample spin at 7000g for 10 min, save flow through for purification 

Qiagen purification: 

-suspend u-con recovered material in SOOmI buffer PB 
-proceed w/ nomnal Qiagen protocol 

3 0 DNAse digest: 

- Add 1 Ml of 1/100 dil of DNAse/30Ml Rx and incubate at 37°C for 1 5 min. 
-5 min 95°C to denature enzyme 
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Sample preparation: 

- Add: 

Cot-1 DNA: 10 |Jl 
50X dNTPs: 1 (Jl 
20X SSC: 2.3 ^ll 
Na pyro phosphate: 7.5 |jl 
10mg/ml Herring sperm DNA 1ul of 1/10 dilution 
21.8 final vol. 

- Dry down In speed vac. 

- Resuspend in 15 |Jl H2O. 
-Add 0.38 Ml 10%SDS. 

- Heat 95°C, 2 min. 

- Slow cool at room temp, for 20 min. 

Put on slide and hybridize ovemight at 64°C. 

Washing after the hybridization: 

3X SSC/0.03% SDS: 2 min. 37.5 mis 20X SSC+0.75mls 10% SDS in 250mls HjO 

1 X SSC: 5 min. 1 2.5 mis 20X SSC in 250mls HjO 

0.2X SSC: 5 min. 2.5 mis 20X SSC in 250mls H2O 

Dry slides in centrifuge, 1000 RPM, 1min. 
Scan at appropiate PMT's and channels. 



The results are shown in the tables and figures. The lists of genes come from cells cultured in an 
in vitro angiogenesis model. As indicated, some of the Accession numbers include expression 
sequence tags (ESTs). Thus, in one embodiment herein, genes within an expression profile, also 
termed expression profile genes, include ESTs and are not necessarily full length. 
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TABLE 1 





Accession #/ 

r KUDtot 1 


Gene Descri tion 




AB000450 


vaccLaTe^a'ted kinase 2 


4 


AB002380 


rumarmRNA for k1^382 ene' artial cds 




AB0031 03 


proteasome (prosome macropa^n)^26S subunit- non-ATPase" 12 


4 


AB004884 


Homo sapiens mRNA for PKU-alpha; partial cds 


1 


AF000573 ma1 


homogentisate 1 ;2-dioxygenase (homogentisate oxidase) 


3 


AF008937 


Homo sapiens syntaxin-16C mRNA, complete cds 


3 


AF009301 


Homo sapiens TEB4 protein mRNA; complete cds 


3 


AF009368 


Homo sapiens Luman mRNA; complete cds 




D00591 







D00760 


^roteasomTf'irsomrm^^^^ ain) subunit al ha t e' 2 

proteasome (prosome macropain) su uni a p a ype, ^ 




1 


D11139 


tissue inhibitor of metalloproteinase 1 (erythroid potentiatinQ activity; 
collagenase inhibitor) 


4 


D14657 


Human mRNA for KIAA1 1 gene; complete cds 


4 


D14878 


D123 gene product 


1 


D17716 


mannosyl (alpha-1 ;6-)-glycoprotein 
beta-1;6-N-acetyl-glucosaminyltransferase 


4 


D21090 


RAD23 (S. cerevisiae) homolog B 


1 


D26135 


diacylglycerol kinase; gamma (9kD) 




D26528 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 7 (RNA helicase; 52kD) 


^ 






4 


D31762 


Human mRNA for KIAA57 gene; complete cds 


4 


D31765 


Human mRNA for KIAA61 gene; partial cds 


3 


D31888 


Homo sapiens clone 2479 mRNA sequence 


4 


D38128 


prostaglandin 12 (prostacyclin) receptor (IP) 


2 


D38500 


postmeiotic segregation increased 2-like 4 


4 


D38551 


RAD21 (S. pombe) homolog 


4 


D42087 


Human mRNA for KIAA1 18 gene; partial cds 


3 


D49396 


Human mRNA for Apo1_Human (MER5(Aop1-Mouse)-like protein); 

complete cds 


4 


D55640 


Human monocyte PABL (pseudoautosomal boundary-like sequence) 
mRNA, clone Mo2 




D63391 


platelet-activating factor acetylhydrolase; isoform lb; gamma subunit 






Human mRNA for KIAA143 gene- partial cds 


4 


D63483 


acetyl LDL receptor; SREC 


4 


D64015 


TIA1 cytotoxic granule-associated RNA-binding protein-like 1 


4 


D79990 


Human mRNA for KIAA168 gene; complete cds 


4 


D79997 


Human mRNA for KIAA175 gene; complete cds 


4 


D80010 


Human mRNA for KIAA188 gene; partial cds 


1 


D84276 


CD38 antigen (p45) 


4 


D86425 


Homo sapiens mRNA for nidogen-2 


4 


D86978 


Human mRNA for KIAA225 qene; partial cds 
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15 



Cluster 


Accession #/ 
PROBESET 


Gene Description 




D87012 


Homo sapiens clone 24675 mRNA sequence 





D87075 


Human mRNA for KIAA238 gene; partial cds 




4 


D87432 


solute carrier family 7 (cationic amino acid transporter; y+ system); 


4 


D87448 


Homo sapiens mRNA for DNA topoisomerase II binding protein; complete 
cds 


2 


D87845 


platelet-activating factor acetylhydrolase 2 (4kD) 


1 


HG1098-HT109a 


Cystatin D 


4 


HG2167-HT2237 


Protein Kinase Ht31, Camp-Dependent 


1 


HG2415-HT2511 


Transcription Factor E2f-2 


1 


HG2825-HT2949 


Ret Transforming Gene 


1 


HG2887-HT3031_r 


Sry-Related Hmg-Box 12 Protein (Gb:X73039) 


4 


HG4660-HT5073 


Microtubule-Associated Protein lb 


3 


HG4704-HT5146 


Glial Growth Factor 2 


4 


HG884-HT884 


Oncogene E6-Ap, Papillomavirus 


1 


HG919-HT919 


Dna Polymerase, Epsllon, Catalytic Subunit 


4 


J00212 f 


Accession not listed in Genbank 


4 


J04029 


keratin 1 (epidermolytic hyperkeratosis; keratosis palmaris et plantaris) 


4 


J04031 


5;1-methylenetetrahydrofolate dehydrogenase; 

5; 1 -methylenetetrahydrofblate cyclohydrolase; 1 -formyltetrahydrofolate 

synthetase 


4 


J0408a 


topoisomerase (DNA) II alpha (17kD) 


4 


J04543 


annexin VII (synexin) 


4 


L06139 


TEK tyrosine kinase; endothelial 


1 


L07540 


ACTIVATOR 1 36 KD SUBUNIT 


4 


L08895 


MADS box transcription enhancer factor 2; polypeptide C (myocyte 
enhancer factor 20) 


1 


L11239 


gastrulation brain homeo box 1 


1 


L11353 


neurofibromin 2 (bilateral acoustic neuroma) 


4 


L 13773 


Human AF-4 mRNA; complete cds 


4 


L13800 


Homo sapiens liver expressed protein gene, 3' end 


4 


L 14922 


replication factor C (activator 1) 1 (145kD) 


4 


L15189 


heat shock 7kD protein 9B (mortalin-2) 


4 


LI 5388 


Human G protein-coupled receptor kinase (GRK5) mRNA, complete cds 


3 


LI 6895 


lysyl oxidase 


4 


L27476 


Friedreich ataxia region gene X14 (tight junction protein ZO-2) 


4 


L27624 


TISSUE FACTOR PATHWAY INHIBITOR 2 PRECURSOR 


1 


L32976 


mixed lineage kinase 3 


1 


L33404 


protease; serine; 6 (chymotryptic; stratum comeum) 


4 


L35263 


cytokine suppressive anti-inflammatory drug binding protein 1 (p38 MAP 

kinase) 


1 


L37347 


natural resistance-associated macrophage protein 2 




L40371 


thyroid hormone receptor interactor 4 


4 


L40391 


Homo sapiens (clone s153) mRNA fragment 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 




L41607 


glucosaminyl (N-acetyl) transferase 2; l-branching enzyme 




1 


L77566 


Homo sapiens DGS-I mRNA; 3' end 


1 


M13928 


aminolevulinate; delta-; dehydratase 


1 


M14016 


uroporphyrinogen decarboxylase 


4 


M14219 


decorin 


4 


M 15796 


proliferating cell nuclear antigen 


4 


M21305 


Human alpha satellite and satellite 3 junction DNA sequence 


4 


M22092 


Human neural cell adhesion molecule (N-CAM) gene, exon SEC and 
partial cds 


4 


M22898 


tumor protein p53 (Li-Fraumeni syndrome) 


3 


M22995 


RAP1A; member of RAS oncogene family 


3 


M23379 


RAS p21 protein activator (GTPase activating protein) 1 


1 


M24364 


major histocompatibility complex; class II; DQ beta 1 


1 


M24400 


chymotrypsinogen B1 


3 


M25753 


cyclin B1 




M27691 


cAMP responsive element binding protein 1 




M28213 


RAB2; member RAS oncogene family 


4 


M29550 


SERINE/THREONINE PROTEIN PHOSPHATASE 2B CATALYTIC 
SUBLIMIT; BETA ISOFORM 




M29971 


0-6-methylguanine-DNA methyltransferase 


4 


M30269 


nidogen (enactin) 


4 


M31158 


protein kinase; cAMP-dependent; regulatory; type II; beta 


3 


M31166 


pentaxin-related gene; rapidly induced by IL-1 beta 


3 


M31210 


endothelial differentiation; sphingolipid G-protein-coupled receptor; 1 


1 


M55420 


Human IgE chain, last 2 exons 


4 


M59979 


prostaglandin-endoperoxide synthase 1 (prostaglandin G/H synthase and 
cyclooxygenase) 


4 


M62810 


transcription factor 6-like 1 (mitochondrial transcription factor 1-like) 


4 


M63838 


interferon; gamma-inducible protein 16 


1 


M64710 


Human C-type natriuretic peptide gene, complete cds 


3 


M68874 


Human phosphatidylcholine 2-acylhydrolase (cPLA2) mRNA, complete 
cds 


3 


M74524 


ubiquitin-conjugating enzyme E2A (RAD6 homolog) 




M80254 


PEPTIDYL-PROLYL CIS-TRANS ISOMERASE; MITOCHONDRIAL 
PRECURSOR 


1 


M81780 cds3 


sphingomyelin phosphodiesterase 1 ; acid lysosomal (acid 
sphingomyelinase) 




IVI83822 


Human beige-like protein (BGL) mRNA; partial cds 


^ 


IV186934 


GS1 PROTEIN 


1 


IV187338 


replication factor C (activator 1)2 (4kD) 


1 


IV196326_ma1 


azurocidin 1 (cationic antimicrobial protein 37) 


4 


IV196954 


TIA1 cytotoxic granule-associated RNA-binding protein-like 1 


4 


M98833 


Friend leukemia virus integration 1 


1 


S66793 


arrestin 3; retinal (X-arrestin) 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 






S72370 






4 


S78569 


laminin; alpha 4 






S79873 


lysosomal-associated membrane protein 2 






S83325 


aspartate beta-tiydroxylase 






S83364 


putative Rab5-lnteracting protein {clone L1-57} [human, HeLa cells, 
mRNA Partial 366 nt] 




1 


S83365 


putative RabS-interacting protein {clone L1-94} [human, HeLa cells, 
mRNA Partial, 369 nt] 




1 


U01212 


Human olfactory marker protein (OMP) gene, complete cds 




1 


U01922 


deafness; X-linked 1; progressive 




4 


U02556 


Human RP3 mRNA; complete cds 


10 


4 


U02680 


protein tyrosine kinase 9 




4 


U03272 


fibrillin 2 




4 


U04209 


Human associated microfibrillar protein mRNA; complete cds 




4 


U05237 


fetal Alzheimer antigen 




1 


U07225 


purinergic receptor P2Y; G-protein coupled; 2 


15 


3 


U07620 


protein kinase mitogen-activated 1 (MAP kinase) 




4 


U09759 


protein kinase mitogen-activated 9 (MAP kinase) 




4 


U09820 


alpha thalassemia/mental retardation syndrome X-linked 




3 


U11313 


sterol carrier protein 2 




3 


U14518 


centromere protein A (17kD) 


20 


4 


U14575 


protein phosphatase 1 ; regulatory (Inhibitor) subunit 8 




3 


U15173 


BCL2/adenovirus E1B 19kD-interacting protein 2 




4 


U 15932 


dual specificity phosphatase 5 




4 


U18291 


cell division cycle 16; anaphase promoting complex 6 




4 


U18300 


damage-specific DMA binding protein 2 (48kD) 


25 


4 


U18383 


nuclear respiratory factor 1 




4 


U20536 


caspase 6; apoptosis-related cysteine protease 




4 


U21551 


Human ECA39 mRNA; complete cds 




4 


U23028 


eukaryotic translation initiation factor 28; subunit 5 (epsilon; 82kD) 






U23752 


SRY (sex-determining region Y)-box 1 1 


30 


4 


U25435 


Human transcriptional repressor (CTCF) mRNA; complete cds 




4 


U25997 


stanniocalcin 




4 


U28251_cds2 


zinc finger protein 169 




4 


U28831 


Human protein immuno-reactive with anti-PTH polyclonal antibodies 
mRNA: partial cds 




4 


U30245 


Human myelomonocytic specific protein (MNDA) gene, 5' fianl<ing 
sequence and compiete axon 1 


35 


4 


U32315 


Human syntaxin 3 mRNA; complete cds 




4 


U32439 


regulator of G-protein signalling 7 




3 


U32849 


N-myc (and STAT) interactor 




4 


U35139 


necdin (mouse) homolog 




1 


U36764 


eukaryotic translation initiation factor 3; subunit 2 (beta; 36kD) 


40 


4 


U39400 


chromosome 1 1 open reading frame 4 




4 


U39657 


protein kinase; mitogen-activated; kinase 6 (MAP kinase kinase 6) 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 


4 


U41344 


proline arginine-rich end leucine-rich repeat protein 


3 


U41766 


a disintegrin and metalloproteinase domain 9 (meltrin gamma) 


3 


U41813 


homeo box A9 


3 


U41815 


Human nucleoporin 98 (NUP98) mRNA, complete cds 


4 


U43286 


Human selenophosphate synthetase 2 (SPS2) mRNA; complete cds 


4 


U44378 


MAD (mothers against decapentaplegic; Drosophila) homolog 4 


4 


U44754 


small nuclear RN A activating complex; polypeptide 1; 43kD 


1 


U47011 Cdsl 


fibroblast growth factor 8 (androgen-induced) 


4 


U47077 


Human DNA-dependent protein l<inase catalytic subunit (DNA-PKcs) 
mRNA; complete cds 


4 


U48251 


Homo sapiens protein l<inase C-binding protein RACK7 mRNA; partial cds 


4 


U50535 


Human BRCA2 region; mRNA sequence CG6 


4 


U56833 


von Hippel-Lindau binding protein 1 


4 


U58091 


cullin4B 


1 


U58837 


cyclic nucleotide gated channel beta 1 


4 


U59289 


cadherin 13; H-cadherin (heart) 


4 


U59863 


TNF receptor-associated factor 2 


4 


U67122 


ubiquitin-lil<e 1 (sentrin) 






caspase 7; apoptosis-related cysteine protease 





U68019 


MAD (mothers against decapentaplegic; Drosophila) homolog 3 




1 


U69611 


a disintegrin and metalloproteinase domain 17 (tumor necrosis factor; 
alpha; converting enzyme) 


4 


U70322 


l<aryopherin (importin) beta 2 


4 


U73524 


Human putative ATP/GTP-binding protein (HEAB) mRNA; complete cds 


4 


U79267 


Human clone 2384 mRNA; partial cds 


4 


U79291 


Human clone 23721 mRNA sequence 


4 


U82671_cds2 


Homo sapiens clone LM1955 H15e3 gene; partial cds 


4 


U84573 


procollagen-lysine; 2-oxoglutarate 5-dioxygenase (lysine hydroxylase) 2 




U90914 


carboxypeptidase D 




U91316 


Homo sapiens mRNA for brain acyl-CoA hydrolase; complete cds 


4 


U91932 


clathrin-associated/assembly/adaptor protein; small 3; 22-l<D; Slgma3A 




U96131 


Homo sapiens HPV16 El protein binding protein mRNA; complete cds 


4 


U97018 


echinoderm microtubule-associated protein-like 




4 


U97188 


Homo sapiens putative RNA binding protein KOC (l<oc) mRNA; complete 
cds 


4 


V00503 


collagen; type 1; alpha 2 


3 


X04327 


2;3-bisphosphoglycerate mutase 


1 


X06389 


synaptophysin 


1 


X07496 


apolipoprotein A-l 


2 


X07820 


matrix metalloproteinase 1 (stromelysin 2) 


3 


X14787 


thrombospondin 1 



71 



wo 01/11086 



PCT/USOO/22061 



Cluster 


Accession #/ 
PRO BESET 


Gene Description 


4 


X15525_rna1 


acid phosphatase 2; lysosomal 






NAD-DEPENDENT METHYLENETETRAHYDROFOLATE 
DEHYDROGENASE 




X16609 


ankyrin 1 ; erythrocytic 




X53586 rnal 


Human mRNA for integrin alpha 6 


^ 




MULTIFUNCTIONAL PROTEIN ADE2 


1 


X54936 


placental growth factor; vascular endothelial growth factor-related protein 


4 


X55740 


5' nucleotidase (CD73) 


2 


X57025 


lnsulin-lil<e growth factor 1 (somatomedin C) 


2 


X60673_rna1 


adenylate kinase 3 


1 


X60708 


dipeptidylpeptidase IV (CD26; adenosine deaminase complexing protein 
2) 




X62048 


wee1+ (8. pombe) homolog 





X63097 


Rhesus blood group; D snt gon 


2 


X63563 




4 


X64037 


general transcription factor IIP; polypeptide 1 (74kD subunit) 


4 


X69636 


hect domain and RLD 2 


4 


X69878 


fms-related tyrosine kinase 4 


4 


X70649 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 


3 


X72841 


H.sapienslEF 7442 mRNA 




X74987 


rib nude L 2'-5' oli oisoaden late s nthetase de endent inhibitor 
nbonuclease L (2 .5-oligoisoadenylate synthetase-dependent) inhibitor 





X83107 


BMX non-receptor tyrosine kinase 







acylphosphatase 1 erythrocyte (common) type 





X85753 


cyclin-depGndGnt kinssG 8 


^ 


X87870 






AoyuDO 


traTstenrre^e 'tor^°otential^^^^ ^^'^"'^ '^^ 

transient receptor poten la c anne 





X89398 cds2 


uracll-DNA glycosylase 




^ 


X89399 


Homo sapiens mRNA for lns(1;3;4;5)P4-binding protein 






H.sapiens mRNA for ESIVl-1 protein 


^ 


X91247 


thioredoxin reductase 1 


4 


X91648 


H.sapiens mRNA for pur alpha extended 3'untranslated region 


4 


X92098 


H.sapiens mRNA for transmembrane protein rnp24 


4 


X92110 


H.sapiens mRNA for hcgVIII protein 


4 


X94703 


RAB28; member RAS oncogene family 




X96506 


H.sapiens mRNA for NC2 alpha subunit 


1 


X97230_f 


Homo sapiens natural killer-associated transcript 5 (NKAT5) mRNA; 
complete cds 


4 


X98263 


H.sapiens mRNA for M-phase phosphoprotein; mpp6 


4 


X98296 


ubiquitin specific protease 9; X chromosome (Drosophila fat facets 
related) 


4 


X99584 


H.sapiens mRNA for SMT3A protein 
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Accession #/ 
PROBESET 


Gene Description 


f 


Y00264 


t A4 recursor roteinf roteasenexin II- Alzheimer disease) 


i 


X2Z^66 


H^a°ienrmRNAfor^RIT^Tot^^^ ' ^ ^"^^ ^^^^^ 


? 


X°II55 


m^(L^n VA meav ^"'^ T Vtide 'l'2- m oxin) 

myosin VA (tieavy po ypepti ^ ■ "^y"^'"' 


^ 


X°Z227 


Human butyrophilin (BTF5) mRNAj conipl6t6 cds 




Y07867 






1 


Y09443 


~ h h 


1 


Y09858 


alkylglyceranephosp^a^^^^ ^rotein 

H.sapiensmRNA orun n own protein 


1 


Y12394 


karyopherin alpha 3 (importin alpha 4) 


5 


£11^55 




1 


211^55 


pToTeTkinasI^mi'toge^ kinase V p4- p41) 




215005 


centromere protsin E (312kD) 





^IH^J 




^ 


AA01 1243 s 


H^sapiens DNA for histone H3a 


2 


AA018418 


ESTs 


2 


AA0 18758 


ESTs 


2 


AA0 18804 


Homo sapiens clone 23675 mRNA sequence 


3 


AA031993 


Homo sapiens HRIHFB2115 mRNA; partial cds 


2 


AA044217 


ESTs; Weal<ly similar to similar to cuticle collagen [C.elegans] 




AA046548 


SWI/SNF related; matrix associated; actin dependent regulator of 
chromatin; subfamily e; member 1 


2 


AA057447 s 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SB WARNING ENTRY 
!!!! [H. sapiens] 


2 


AA058376 


Sjogren syndrome antigen A2 (6kD; ribonucleoprotein autoantigen 
SS-A/Ro) 


4 


AA083572 


v-ral simian leukemia viral oncogene homolog A (ras related) 


4 


AA085696 


ESTs 


2 


AA088744 


ESTs 




AA089688 


ESTs; Weakly similar to putative T1/ST2 receptor binding protein 
precursor [H.saplens] 




AA091284 


ESTs 


2 


AA092700 


ESTs 


1 


AA092968 


ESTs 


4 


AA094800 


eukaryotic translation initiation factor 3; subunit 7 (zeta; 66/67kD) 


4 


AA100219 


ESTs 


4 


AA114885 


ESTs 


4 


AA129547 


ESTs; Weakly similar to III! ALU SUBFAMILY SQ WARNING ENTRY II!! 

[H. sapiens] 




AA133016 


ESTs 


3 


AA149507 


homolog of mouse quaking QKI (KH domain RNA binding protein) 


2 


AA151005 


sperm surface protein 




AA187101 


zp61b6.r1 Stratagene endothelial cell 937223 Homo sapiens cDNA clone 
IMAGE:624659 5'. mRNA sequence 


3 


AA195179 s 


ESTs 
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Accession #/ 
PROBESET 


Gene Description 




2 


AA203138 


low density lipoprotein receptor (familial hypercholesterolemia) 




2 


AA203645 


ESTs; Moderately similar to SH3-containing protein p415 [R.norvegicus] 






AA206236 


zq54c6.r1 Stratagene neuroepithelium (#937231) Homo sapiens cDNA 
clone IMAGE:645418 5" similar to TR:G 122922 G 122922 ALLOGRAFT 
INFLAMMATORY FAGTOR-1. ;, mRNA sequence 




^ 


AA227621 


ESTs; Weakly similar to weal< similarity to collagens [C.elegans] 


5 




AA248283 


ESTs; Weakly similar to X-linked retinopathy protein {G-tenninal; clone 
XEH.8c} [H.sapiens] 




3 


AA24961 1 


H.sapiens mRNA for 21-Glutamic Acid-Rich Protein (21-GARP) 




2 


AA282640 


ESTs 




2 


AA287199 


Human mRNA for KIAA81 gene; partial cds 




2 


AA313990 


ESTs; Highly similar to HYPOTHETICAL 3.5 KD PROTEIN C3A5.3 IN 
CHROMOSOME III [Caenorhabditis elegans] 


10 


2 


AA3 14256 


EST18611 Colon carcinoma (HOC) cell line II Homo sapiens cDNA 5' end, 
mRNA sequence 






AAo14ooy 


ESTs; Highly similar to ADP-RIBOSYLATION FACTOR 1 
[Saccharomyces cerevisiae] 






2 


AA324364 


ESTs; Moderately similar to !!!! ALU SUBFAMILY J WARNING ENTRY III! 
[H.sapiens] 




3 


AA329211_S 


ESTs 






A A'5QQ1 fl7 


ESTs 


15 





AA421079 






^ 


AA422029 


ES^ 




3 


AA425230 


Human GAP SH3 binding protein mRNA; complete cds 






AA447052 


ESTs; Highly similar to N-terminal asparagine amidohydrolase 
[M.musculus] 




4 


AA452000 


ESTs 


20 


4 


AA456687 


ESTs 




4 


AA487015 s 


ESTs; Weakly similar to X-linked retinopathy protein {C-terminal; clone 
XEH.Sc} [H.sapiens] 




2 


AB002326 


Human mRNA for KIAA328 gene; partial cds 




4 


AFFX-BioB-3 






2 


C01527 


ESTs 


25 


4 


C01714 


Homo sapiens serum-inducible kinase mRNA; complete cds 




3 


C01811_f 


ESTs 




2 


C02352_s 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SQ WARNING ENTRY 
!!!! [H.sapiens] 




1 


C02375 


Human mRNA containing an Alu repeat and its reverse complement 




2 


CI 4448 


EST 


30 


4 


D16611 s 


coproporphyrinogen oxidase (coproporphyria; harderoporphyria) 




2 


D25216 


Human mRNA for KIAA14 gene; complete cds 




2 


D31352 


ESTs; Weakly similar to hypothetical protein [H.sapiensl 
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PRO BESET 


Gene Description 


4 


D58024 s 


ESTs; Weakly similar to probable hormone receptor EMR1 precursor 
[H.sapiens] 


3 


D80897 


Homo sapiens clone 24736 mRNA sequence 




D82614 


ESTs 


4 


D87845 


piatelet-activating factor acetyl liydrolase 2 (4kD) 




D89377 i 


msh (Drosophiiia) homeo box homolog 2 




H06583 


cAMP responsive element binding protein-iike 2 




H40732 


ESTs 


4 


H46617 


yp19h1.r1 So3res bresst SNbHBst Homo S3piens cDNA clone 
IMAGE:187921 5', mRNA sequence 




H56731 


ESTs 




H75570 


ESTs 




H78886 


ESTs 


1 


H81241 


ESTs; Highly similar to ERYTHROID KRUEPPEL-LIKE TRANSCRIPTION 
FACTOR [Mus musculus] 


1 


L36531 


integrin; alpha 8 


2 


M63154 


gastric intrinsic factor (vitamin B synthesis) 


4 


M63180 


threonyl-tRNA synthetase 


2 


M91504 


ESTs 


2 


N56191 


Homo sapiens protocadherin 68 (PCH68) mRNA; complete cds 


2 


N78483 


ESTs 


2 


N79268 


zinc finger protein 198 


2 


R14652 


Homo sapiens PAC clone DJ872F7 from 7q31 


2 


R20459 


yg33f12.r1 Soares infant brain 1NIB Homo sapiens cDNA clone 
IMAGE:34345 5". mRNA sequence 


3 


R22303 


ESTs; Weakly similar to putative p1 5 [H.sapiens] 


2 


R33779 


ESTs; Weakly similar to p4 [H.sapiens] 


2 


R36553 


ESTs; Weakly similar to KIAA681 protein [H.sapiens] 


2 


R64534 


ESTs 


4 


R66475 


ESTs 


4 


R70621 


Homo sapiens mRNA for KIAA896 protein; partial cds 


3 


R79356 


ESTs; Weakly similar to PROTEIN Q3 [Mus musculus] 


2 


R84933 


ESTs; Weakly similar to putative p15 [H.sapiens] 


3 


RC_AA007160 


ESTs 


2 


RC AA007234 s 


ESTs; Highly similar to protein tyrosine phosphatase epsilon cytoplasmic 
isoform [H.sapiens] 


2 


RC AA018409 


ESTs 


4 


RC AA025351 


ESTs 


3 


RC AA027168 


ESTs 


1 


RC AA027317 


ESTs; Weakly similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 


3 


RC AA029423 


ESTs 


4 


RC AA031357 


ESTs 


4 


RC_AA045136 


ESTs 


1 


RC AA053400 


ESTs 
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Accession #/ 
PROBESET 


Gene Description 




3 


RC AA055829 


ESTs; Weakly similar to 111! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 




3 


RC AA065217 


ESTs 




1 


RC AA 116054 


ESTs 




1 


RC_AA 126311 


ESTs 


5 


4 


RC AA1 29390 


ESTs 




4 


RC AA1 30273 


ESTs; Highly similar to DOSAGE COMPENSATION REGULATOR 
[Drosophila melanogaster] 




2 


RC AA142919 


ESTs 




4 


RC AA1 50205 


ubiquitous Kruppel-llke transcription factor 




1 


RC AA1 76867 


ESTs 


10 




RC AA 180321 


ESTs; Highly similar to U1 small nuclear ribonucleoprotein 1SNRP 
homolog [H.sapiens] 







RC AA 180487 






^ 


RC AA 187634 


euk^oticTranlatoTiniH^^^^ 




3 


RC AA1 95399 


ESTs 




3 


RC AA234717 


ESTs 


15 


4 


RC_AA234743 


ESTs 




3 


RC_AA234957 


Homo sapiens mRNA for MTMR1 protein 




3 


RC AA235604 


ESTs 




3 


RC AA236559 


ESTs; Weakly similar to PROBABLE E5 PROTEIN [Human papillomavirus 

type 58] 




3 


RC AA242868 


ESTs; Weakly similar to house-keeping protein [M.musculus] 


20 


4 


RC AA251776 


jun D proto-oncogene 




4 


RC AA251909 


Homo sapiens protein kinase homolog (BUBR1) mRNA; complete cds 






RC AA252672 S 


diptheria toxin resistance protein required for diphthamide biosynthesis 







RC AA256157 


(^^ccharomyces)-like 2 ___ 

— _! 







RC AA256680 


! 


25 





RC AA2 58873 








^ 


RC AA262727 


EST^ 

— ! 






RC AA281451 


£_S 







RC AA281545 









RC AA282069 


Homosa lens mRNA for KIAA63 rotein- com lete cds 

omo sapiens m or pro em. comp e e c s 


30 




— ^! — 


RC AA283044 


i_! 






RC AA283930 


^— ^ — : — \ 




— - — 


RC AA284755 


ESTsj Weakly similsrto unknown [H.sspiens] 




— - — 


RC AA291268 


^^l! 




— - — 


RC AA291927 






— ^ — 


RC AA343514 


EST^ 




3 


RC AA398109 


ESTs 




4 


RC AA405737 


ESTs 




4 


RC_AA406610 


ESTs 




4 


RC_AA4 11465 


ESTs 


40 


3 


RC AA416886 


ESTs; Weakly similar to predicted using Genefinder TCeieqansl 
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RC AA424013 


Homo sspiens clone 23767 and 23782 mRNA sequences 






1 


RC AA424148 








RC AA424558 


ESTs; Weakly similar to 33-kDa phototransducing protein [H.sapiens] 







RC AA424961 S 


Homo sapiens TEB4 protein mRNlA; complete cds 


5 





RC AA425367 








1 


RC AA425921 


Homo sapiens 1 1 receptor candidate protein mRNA- complete cds 






RC AA426220 


Homo sapiens mRNA for KIAA523 proteinj partial cds 






1 










RC AA430673 


ESTs 


10 





RC AA432248 


^g^^ 







RC AA435896 


EST^ ~ 







RC AA436705 


Homo sa lens mRNA for KIAA766 rotein' complete cds 







RC AA446561 


Homo sapiens mRNA for KIAA47 protein^complete cds 




— 2 — 


RC AA448238 


Homo sapiens mRNA for KIAA91 5 protein; complete cds 


15 


— 5 — 


RC_AA448688 


ESTs' Weakly similar to KIAA638 protein [H.sapiens] 






RC AA449756 


ESTs, Weal<ly similar to rA8 [R.norvegicus] 




— ^ — 


RC AA450303 








RC AA45241 1 


iii^ ~ 




— - — 


RC AA454566 


ribosomal rotelnL13 ~~ 


20 




RC AA454667 


Es'ts"'^^ ' 






RC AA4 56437 


iiir^ ~ 




4 


RC AA456646 


ESTs 






RC AA456826 


ESTs 




4 


RC AA456981 


ESTs 


25 


4 


RC AA458959 


ESTs 




3 


RC AA459950 


ESTs 






RC AA460449 


ESTs; Highly similar to PROBABLE PHOSPHOSERINE 
AiVIINOTRANSFERASE [Oryctolagus cuniculus] 







RC AA463910 








1 


RC AA464603 


ESTs ~ 


30 




RC AA464606 


EST^ ~~~ 







RC AA465093 


TIA1 cytotoxic ranule-associated RNA-bindin rotein 







RC AA465692 


Homo^sa°rnsmRrA^forK°AAM8 rotein" artia^cds" 







RC AA476473 


Homo 53^16115 Trio mRNA- com letrcds"' 




i 


RC AA478109 


ESTs°^^^'^"^ """^ . comp 8 8 c s 


35 




RC AA478474 


ESTs 




3 


RC AA480889 


Esri ~ — ~ 







RC AA485223 


ESTs ~ 




1 


RC AA485254 


ESTs 




4 


RC AA486183 


ESTs; Weakly similar to Ytirlwp [S.cerevisiae] 


40 


3 


RC_AA496936 


ESTs; Weakly similar to B cell growtti factor [H.sapiens] 




4 


RC AA598589 


ESTs 




4 


RC AA598831 f 


ESTs 




4 


RC AA600150 


ESTs 
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Accession #/ 
PROBESET 
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4 


RC AA608545 


ESTs 


3 


RC AA609210 


ESTs 


3 


RC AA610108 


ESTs; Highly similar to PROBABLE PEPTIDYL-PROLYL CIS-TRANS 
ISOMERASE C21E11.5C [Schizosaccliaromyces pombe] 


4 


RC AA620582 


ESTs; Wealcly similar to (defline not available 424227) [H.sapiens] 




RC AA621239 


ESTs; Highly similar to HYPOTHETICAL 98.3 KD PROTEIN R1E12.1 IN 
CHROMOSOME III [Caenorhabditis elegans] 





RC AA621714 


ESTs 




RC AA621718 


ESTs 




RC D 19673 


ESTs 


i 


RC D25755 s 


ESTs 


1 


RC D51095 


ESTs 


4 


RC D60272 i 


ESTs; Weakly similar to macrophage lectin 2 [H.sapiens] 


2 


T08879 


cathepsin F 


3 


T34527 


UDP-N-acetyl-alpha-D-galactosamine:polypeptide 
N-acetylgalactosaminyltransferase 1 (GalNAc-TI) 


2 


T40327 s 


ESTs 


3 


T62771 s 


Homo sapiens nucleoplasmin-3 (NPM3) mRNA; complete cds 




T63174 S 


ESTs; Weakly similar to neuronal thread protein AD7c-NTP [H.sapiens] 


^ 


T83444~ 


Homo sapiens mRNA for KIAA887 protein; partial cds 


i 


193641 


ESTs 


^ 


U48263 


prepronociceptin 





U49065 


interleukin 1 receptor-like 2 




U79300 


Human clone 23629 mRNA sequence 


— ^ — 


U88573 


Human NBR2 mRNA; complete cds 




U93867 


Human RNA polymerase III subunit (RPC62) mRNA; complete cds 


4 


W0 1094 


ESTs 


2 


W0 1568 


ESTs 


2 


W26853 


ESTs 


2 


W27179 


BCL2/adenovirus E1B 19kD-interacting protein 3-like 


2 


W27965 


epimorphin 




W36280 s 


Homo sapiens RRM RNA binding protein Gry-rbp (GRY-RBP) mRNA; 
complete cds 




W47063 


ESTs 




W79060 


ESTs; Weakly similar to Ras-binding protein SUR-8 [M.musculus] 




W88550 


ESTs; Moderately similar to trg gene product [R.norvegicus] 





X60486 


H4 histone family; member G 




X78931 s 


H.sapiens HZF8 mRNA for zinc finger protein 




Z14077_s 


YY1 transcription factor 




RC AA002147 


EST 




RC AA004711 


ESTs 




RC AA010383 


EST 




RC AA015761 


ESTs 
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2 


RC AA0 18772 


ESTs 


2 


RC AA021473 r 


EST 




RC AA024835 


potassium voltage-gated channel; delayed-rectifier; subfamily S; member 




RC AA025858 


ESTs 




^ 


RC AA027229 


ESTs ~ 


^ 


RC AA029428 


isTs ~ 




RC AA035143 


ESTs 




RC AA035237 







RC AA039347 


ES? ~ 


\ 




^g.^.^ 




RC AA041551 


iirs ~ 





RC AA045513 


isTs 




RC AA045745 


iir^ 





RC AA055348 


ESTs 




RC AA056582 s 


ESTs 




RC AA056697 


ESTs 




RC AA056746 


EST 


3 


RC AA057678 


ESTs 


2 


RC AA058681 


ESTs 


2 


RC AA058686 


ESTs 


2 


RC AA062840 


zm5c1.s1 Stratagene corneal stroma (#937222) Homo sapiens cDNA 
clone IMAGE:513234 3' similar to gb:S71381 PROTEASOME BETA 
CHAIN (HUMAN);, mRNA sequence 


2 


RC AA064859 


zm5f3.s1 Stratagene fibroblast (#937212) Homo sapiens cDNA clone 
IMAGE;52985 3', mRNA sequence 


1 


RC AA065069 


zm12e11.s1 Stratagene pancreas (#93728) Homo sapiens cDNA clone 
IMAGE:525452 3', mRNA sequence 


1 


RC AA069923 


zm67g3.s1 Stratagene neuroepithelium (#937231) Homo sapiens cDNA 
clone IMAGE:5374 3' similar to gb:S66915 cdsl ATP SYNTHASE 
GAMMA CHAIN, MITOCHONDRIAL PRECURSOR (HUMAN);, mRNA 
sequence 


2 


RC AA070799 S 


zm6h5.s1 Stratagene fibroblast (#937212) Homo sapiens cDNA clone 
IMAGE:5373 3", mRNA sequence 


2 


RC AA070815 


zm6a1.s1 Stratagene fibroblast (#937212) Homo sapiens cDNA clone 
IMAGE:529992 3' similar to gb:X67951 PROLIFERATION-ASSOCIATED 
PROTEIN PAG (HUMAN);, mRNA sequence 


2 


RC AA075374 


zm87a1 .s1 Stratagene ovarian cancer (#937219) Homo sapiens cDNA 
clone IMAGE:544872 3', mRNA sequence 


2 


RC AA076382 


zm91g8.s1 Stratagene ovarian cancer (#937219) Homo sapiens cDNA 
clone IMAGE:545342 3', mRNA sequence 


1 


RC AA078787 


ESTs 
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2 


RC AA078986 


zna92h1 .s1 Stratagene ovarian cancer (#937219) Homo sapiens cDNA 
clone IMAGE:545425 3", mRNA sequence 


1 


RC AA079393 


zm95h1 1 .si Stratagene colon HT29 (#937221 ) Homo sapiens cDNA 
clone IMAGE:545733 3' similar to gb:X1656 CYTOCHROME C OXIDASE 
POLYPEPTIDE VUG PRECURSOR (HUMAN);, mRNA sequence 


2 


RC AA079487 


zm97f8.s1 Stratagene colon HT29 (#937221) Homo sapiens cDNA clone 

IMAGE:545895 3', mRNA sequence 


2 


RC AA083207 


EST 




RC AA083256 




2 


RC AA084415 


zn6g9.s1 Stratagene hNT neuron (#937233) Homo sapiens cDNA clone 
IMAGE;546688 3', mRNA sequence 




RC AA085274 


znlfl.sl Stratagene colon HT29 (#937221) Homo sapiens cDNA clone 
IMAGE:546169 3' similar to gb:X15341 CYTOCHROME C OXIDASE 
POLYPEPTIDE VIA-LIVER (HUMAN);, mRNA sequence 


^ 


RC AA088678 


ESTs 


3 


RC AA 100925 


ESTs; Weakly similar to predicted using Genefinder [C.elegans] 






ESTs; Highly similarto J KAPPA-RECOMBINATION SIGNAL BINDING 


3 




stamlocalcinT" ^^'''^"^^ ~ 


2 


RC AA127017 


ESTs 


2 


RC AA1 29968 


ESTs; Weakly similar to protein phosphatase 2A 13 kOa regulatory 
subunit [H.sapiens] 


2 


RC AA 130240 


ESTs 


1 


RC AA131866 


ESTs 


2 


RC AA1 32039 


ESTs; Moderately similarto !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 


3 


RC AA1 32983 


ESTs; Moderately similar to C-1-TETRAHYDROFOLATE SYNTHASE; 

CYTOPLASMIC (Saccharomyces cerevisiae] 


3 


RC_AA 133250 


ESTs; Weakly similar to NADH-UBIQUINONE OXIDOREDUCTASE 
CHAIN 4 [Caenorhabditis elegans] 


1 


RC AA1 33583 s 


high-mobility group (nonhistone chromosomal) protein isoform l-C 




RC AA135941 


ESTs 


2 


RC AA1 48650 


zo9e6.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens 
cDNA clone IMAGE:56722 3', mRNA sequence 


2 


RC AA151110 


ESTs 


2 


RC AA1 55754 


ESTs; Moderately similarto III! ALU SUBFAMILY SX WARNING ENTRY 

!!!! [H.sapiens] 


4 


RC AA156125 


ESTs 


2 


RC AA1 56289 


ESTs 


1 


RC AA1 56997 


ESTs 


2 


RC AA1 57291 


ESTs 
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2 


RC AA1 57293 


ESTs 


2 


RC AA164293_f 


ESTs 


1 


RC AA1 64676 


EST 


1 


RC AA 167375 


Homo sapiens mRNA for KIAA53 protein; partial cds 


1 


RC AA1 67550 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SX WARNING ENTRY 

1!!! [H. sapiens] 


2 


RC AA1 76589 


EST 


1 


RC AA1 80448 


EST 




RC AA187144 s 


endothelin 1 


3 


RC AA189170 f 







RC AA1 92757 


ESTs 


2 


RC AA205650 


ESTs 


4 


RC AA233342 


ESTs; Weakly similar to neural differentiation-associated protein 

[M.musculus] 


3 


RC AA233472 


ESTs 


2 


RC AA234110 


ESTs 


4 


RC D80981 


ESTs 


3 


RC F01660 


ESTs; Weakly similar to HYPOTHETICAL PROTEIN HI34 [Haemophilus 
influenzae] 




RC F02206 


EST; Highly similar to ether-a-go-go-related protein [H. sapiens] 


4 


RC F02208 







RC F02544 


EST^ ~ 





RC F03918 


ESTs 


4 


RC F04258 s 


ESTs; Highly similar to INORGANIC PYROPHOSPHATASE [Bostaurus] 


4 


RC F04600 


ESTs 


4 


RC F08998 


ESTs 


2 


RC F09605 


ESTs 


4 


RC F11115 


ESTs 


3 


RC H06371 


ESTs 


1 


RC H 10995 


ESTs 


1 


RC H11938 


ESTs; Weakly similar to HYPOTHETICAL 97.6 KD PROTEIN IN 
SHP1-SEC17 INTERGENIC REGION [Saccharomyces cerevisiae] 


4 


RC H 16568 


ESTs 


4 


RC HI 6772 


ESTs 


1 


RC H18951 


ESTs; Moderately similar to seven-pass transmembrane receptor 
precursor [M.musculus] 


1 


RC H20859 


ESTs 


1 


RC H23747 


ESTs 


1 |rC H38087 


ESTs 


1 


RC H40331 


ESTs 


1 


RC H40567 


ESTs 
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Cluster 


Accession #/ 

PROBESET 


Gene Description 


1 


RC H46966 


ESTs 




RC H56640 i 


/q99a5.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 





RC H57154 


ESTs" Weakly similar to RST [M.musculus] 




^ 


RC H96712 


^§1! 


1 


RC N20814 






RC N25249 


^ i ociated rotein- 23kD ' 

synaptosoma -associa e pro em. . 


1 


RC N27100 




] 


RC N39616 


RNA^( uanine 7 meth transferase 

RNA (guanine- -)me y rans erase 




RC N48982 







RC N51957 


fstT" ' 




RC N52271 


Homo sapiens LIM protein mRNA; complete cds 




RC N59435 


ESTs: Weakly similar to No definition line found [H.sapiens] 




RC N64139 


ESTs; Weakly similar to Ndr protein kinase [H.sapiens] 




RC N56981 


ESTs 




RC N68640 


ESTs 




RC N69352 


ESTs; Highly similar to rKb-MKrNiA orLIOIINo rr\K> l \Jr\ r\Wr\ nci-ioMoc 
PRP22 [Saccharomyces cerevisiae] 




RC N95226 


Homo sapiens mRNA for KIAA758 protein; partial cds 


-■ 


RC R00138 


ESTs 




RC R07998 


ESTs; Weakly similar to III! ALU SUBFAMILY J WARNING ENTRY III! 




RC R08929 


ubiquitin-conjugating enzyme E2G 2 (homologous to yeast UBC7) 




RC R10307 


ESTs 




RC R33354 


ESTs 




RC R36083 


ESTs 




RC R37938 f 


ESTs 




RC R39330 


yd1g4 s1 Soares infant brain 1NIB Homo sspisns cDNA clon6 
IMAGE:24282 3', mRNA sequence 




RC R40816 s 


cullin 4A 




RC R43162 s 


ESTs 


3 


RC R45698 


ESTs' Weakly similar to Similarity to Salmonella regulatory protein UHPC 

[C.elegans] 


2 


RC R54554 


ESTs 


1 


RC R68425 


ESTs; Weakly similar to alternatively spliced product using exon 13A 
[H.sapiens] 


1 


RC R68568 


ESTs 


3 


RC R68763 


ESTs 


1 


RC R70467 


ESTs 


1 


RC R73565 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SX WARNING ENTRY 
!!!! [H.sapiens] 


4 


RC R73640 


ESTs 
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Accession #/ 
PROBESET 


Gene Description 






RC R78376 






1 


RC R92453 


EST 






RC T03865 


ESTs 




3 


RC T03872 


ESTs 




1 


RC T10072 








RC T10080 


ESTs ' 




1 


RC T 10132 


Homo sapiens mRNA for KIAA478 protein; complete cds 






RC T 1 5343 








RC T23457 


ESTs 







RC T23555 


ESTs 




2 


RC T23670 


ESTs 




— 2 — 


RC T23948 






4 


RC T33464 


ESTs 




i 


RC T34413 


ESTs 







RC T3461 1 


ESTs 







RC T40920 


ESTs 




— 2 — 


RC T55182 


ESTs 




— - — 


RC T77453 


EST 




— ^ — 


RC T84039 


ESTs 


20 


— — 


RC T86458 


ESTs 




— — 


RC T87693 


ESTs 




2 


RC T89350 s 


ESTs 




i 


RC T90945 


ESTs 




2 


RC T90987 


ESTs 


25 


1 


RC_T91863 


ESTs 




1 


RC T91881 


EST 




1 


RC T93783 s 


ESTs 




1 


RC T96687 


ESTs 




2 


RC T96944 


ESTs 


30 




RC T97307 


ESTs; Weakly similar to neuronal thread protein AD7C-NTP [H. sapiens] 




1 


RC T97764 


ESTs 




2 


RC W48817 


ESTs 




2 


RC W58343 


ESTs 




1 


RC W59949 


ESTs; Highly similar to RAS-LiKE PROTEIN TC1 [Homo sapiens] 


35 


1 


RC W74644 


ESTs 






RC W74761 


ESTs; Highly similar to UBIQUITIN-CONJUGATING ENZYME E2-17 KD 
[Caenorhabditis elegans] 




^ 


RC W74802 


ESTs 




1 


RC W81205 


ESTs 




2 


RC W81237 


ESTs 


40 


3 


RC W90146 f 


ESTs 




1 


RC W92798 


ESTs 




1 


RC Z38412 


EST 




1 


RC Z38709 


inositol 1;4;5-triphosphate receptor; type 2 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 


1 


RC Z38904 


ESTs 


2 


RC Z39103 


core-binding factor runt domain; alplia subunit 2; translocated to; 2 


2 


RC Z39930 f 


ESTs 


2 


RC Z39939 


ESTs 


3 


RC Z40012 i 


Homo sapiens mRNA for KIAA587 protein; complete cds 


2 


RC Z40377 s 


ESTs 


] 


RC Z40820 






RC Z41680 


ESTs ' 





AFFX-BioB-3 






1 


RC AA005112 


Human zinc finqer domain containing protein mRNA- partial cds 

uman zinc- inger 1 . 


4 


RC AA005432 


ESTs; Highly similar to ANTI-SILENCING PROTEIN 1 (Saccharomyces 
cerevisiae] 


4 


RC AA010163 


Human mRNA for KIAA312 gene; partial cds 


4 


RC AA026356 


ESTs 




RC AA026901 


ESTs 


4 


RC AA036867 


ESTs 


1 


RC AA044644 


Pp52 


4 


RC_AA046426 


Homo sapiens MSE55-related protein (UB1) mRNA; complete cds 


4 


RC AA054515 


ESTs; Weakly similar to X-linked retinopathy protein {C-terminal; clone 
XEH.8c}[H.sapiens] 


2 


RC AA084162 


zn17h6.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens 
cDNA clone IMAGE:547739 3', mRNA sequence 


4 


RC_AA085749 


Homo sapiens mRNA for ATP binding protein; complete cds 


4 


RC AA098874 


ESTs 


2 


RC AA101056 


zn25b3.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens 
cDNA done IMAGE:548429 3' similar to contains L1 .b3 LI repetitive 
element ;, mRNA sequence 


1 


RC AA1 02746 


ESTs; Moderately similar to cytotoxic ligand TRAIL receptor [H.sapiens] 


2 


RC AA114250 S 


Homo sapiens mRNA for KIAA512 protein; complete cds 


4 


RC_AA1 26561 _s 


ESTs 


^ 


RC AA 128980 i 


ESTs; Weakly similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 


4 


RC_AA1 29757 


ESTs; Highly similar to 6S RIBOSOMAL PROTEIN L22 [Rattus 

norvegicus] 


4 


RC AA1 29921 


ESTs 


2 


RC AA1 33331 


Homo sapiens mRNA for KIAA741 protein; complete cds 


2 


RC AA1 35958 


ESTs 


4 


RC_AA136524_s 


ESTs 


4 


RC AA 147044 


ESTs; Weakly similar to transformation-related protein [H.sapiens] 


4 


RC AA 148885 


ESTs 


4 


RC AA 150043 


ESTs 


2 


RC AA151621 


ESTs 


4 


RC AA1 55743 


ESTs 



84 



wo 01/11086 



PCT/USOO/22061 



10 



20 



25 



30 



Cluster 


Accession #/ 
PROBESET 


Gene Description 


2 


RC AA 156335 


ESTs 




RC AA1 56336 


Homo sapiens nuclea. receptor co-repressor N-CoR mRNA, complete cds 





RC AA159181 


— ! 





RC_AA 159825 


— 





RC AA234185 


— ! 




4 


RC AA234929 


ESTs 


1 


RC AA234935 


ESTs 


4 


RC AA236359 


ESTs 


2 


RC AA236466 


ESTs 


2 


RC AA236535 


ESTs 


4 


RC AA236935 S 


Human normal keratinocyte mRNA 


2 


RC AA236942 


ESTs 


4 


RC_AA237018 


ESTs 








2 


RC AA242751 


Homo sapiens mRNA for KIAA93 protein; partial cds 


3 


RC_AA242760 


ESTs 


3 


RC AA242763 


Homo sapiens Cdc14B1 ptiospiiatase mRNA; complete cds 


2 


RC_AA242809 


ESTs: Weakly similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 

[H.sapiens] 


? 


RC AA243133 


ESTs; Highly similar to SERINE/THREONINE-PROTEIN KINASE IPL1 
(Saccharomyces cerevisiae] 




RC AA243495 








RC AA243706 


§§I2 




4 


RC AA250848 


zs6e2.s1 NCI_GGAP_GCB1 Homo sapiens cDNA clone IMAGE:68441 3', 
mRNA sequence 


2 


RC„AA2 50868 


ESTs 


4 


RC_AA251 1 52 


ESTs 


2 


RC_AA251544_s 


ESTs 


4 


RC__AA251792 


ESTs 


4 


RC AA252063 


Homo sapiens mRNA for PCDH7 (BH-Pcdh)c; complete cds 


3 


RC AA252144 


ESTs 


4 


RC AA252524 


ESTs 


3 


RC AA253461 


ESTs 


4 


RC AA255522 


ESTs 


2 


RG_AA256468 


ESTs 




RC_AA2 56528 


ESTs 


2 


RC AA257976 


ESTs 


4 


RC__AA2 58296 


Homo sapiens mRNA for KIAA579 protein; partial cds 


3 


RC_AA258409 


H.sapiens gene from PAC 313L4; similar to Myelin PO 


2 


RC AA258421 


Homo sapiens clone 683 unknown mRNA; complete sequence 
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Accession #/ 

PROBESET 


Gene Description 


3 


RC_AA262077 


Human NAD+-dependent succinate-semlaldehyde deliydrogenase 
(SSADH) mRNA; 3' end 


4 


RC AA278650 


ESTs 


2 


RC AA278766 


ESTs 


4 


RC AA279667 S 


natural killer-tumor recognition sequence 


3 


RC AA280791 


eukaryotic translation initiation factor 5 


4 


RC_AA280819 


ESTs 


4 


RC AA280828 


ESTs 


4 


RC AA282195 


ESTs; Weakly similar to ORF YNL292w [S.cerevisiae] 


2 


RC AA283127 S 


Homo sapiens clone LM1955 H15e3 gene; partial cds 


2 


RC AA284694 


Homo sapiens CGI mRNA; complete cds 


3 


RC AA291 137 


^^^^ ~ ' ~, — : — ; 


^ 


RC AA291708 


ESTs; Moderately similar to liypotlietical protein [H. sapiens] 


3 


RC AA293495 


Homo sapiens BAC clone 255A7 from 8q21 containing NBS1 gene; 
complete sequence 


4 


RC AA347193 


ESTs; Weakly similar to NADH-UBIQUINONE OXIDOREDUCTASE 
CHAIN 4 [Caenorliabditis elegans] 


4 


RC AA398474 s 


ESTs 


4 


RC AA398512 


ESTs 


2 


RC AA400277 


ESTs; Weakly similar to putative p15 [H.sapiens] 


4 


RC AA400896 


ESTs 


3 


RC AA404494 


CTP synthase 


2 


RC AA4 10345 


ESTs; Weakly similar to A33 antigen precursor [H.sapiens] 


4 


RC AA416733 


ESTs; Weakly similar to neuronal thread protein AD7c-NTP [H.sapiens] 


4 


RC AA425154 


ESTs 


4 


RC AA426573 


ESTs 


2 


RC AA431418 


N-acetylglucosaminidase; alpha- (Sanfilippo disease IIIB) 




RC AA436182 


ESTs 


2 


RC_AA437099 


ESTs 


^ 


RC AA446585 





3 


RC AA446887 


ESTs 


2 


RC_AA447224 


ESTs; Highly similar to HYPOTHETICAL 8.7 KD PROTEIN IN 
ERG7-NMD2 INTERGENIC REGION [Saccharomyces cerevisiae] 


2 


RC_AA447709 


ESTs; Moderately similar to putative transcription factor CA1 5 [H.sapiens] 


4 


RC AA453624 


deoxynucleotidyltransferase; terminal 


4 


RC AA455044 


ESTs 




RC AA456045 


ESTs 


4 


RC AA460454 S 


ESTs; Weakly similar to KIAA512 protein [H.sapiens] 


4 


RC AA476494 


ESTs; Weakly similar to KIAA512 protein [H.sapiens] 


4 


RC AA476738 


ESTs; Highly similar to FLI-LRR associated proteln-1 [M musculus] 


4 


RC_AA481422 


Homo sapiens mRNA for H-2K binding factor-2; complete cds 


3 


RC AA482269 


Integral membrane protein 1 
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Accession #/ 
rKUbtot 1 




— I — 


RC AA482595 


ESTs- HThl'^sWIar to 36 ' 

s, ig ysimiar op 


— £ — 


RC_AA485084_s 


s 


4 


RC AA485431 s 






RC AA489057 


H sapiens mRNA for nuclear rotein SA 2 

H.sapiens m ornucearpro_ein _ — 





RC AA489638 


— ! 





RC AA491000 


— ! _ ■ 





RC AA491250 


! _— 


^ 


RC AA5051 33 




4 


RC AA598447 


Homo sapiens exportin t mRNA; complete cds 


3 


RC_AA599243 


ESTs 


3 


RC AA599574 i 


ESTs 


4 


RC AA600153 


DEK gene 


4 


RC AA609309 


ESTs 


4 


RC AA609710 


ESTs; Highly similar to HYPOTHETICAL GTP-BINDING PROTEIN IN 
PMI4-PAC2 INTERGENIC REGION [Saccharomyces cerevisiae] 


4 


RC_AA610068 


H.sapiens mRNA for PIBF1 protein; complete 


1 


RC AA621399 


ESTs 


1 


RC AA621752 


Human 26S proteasome-associated pad1 homolog (P0H1) mRNA; 
coniplete cds _ 




RC C21523 




^ 


RC D12160 


ESTs- Weakly similar to unknown [H sapiens] 


4 


RC D 19708 


ESTs 


2 


RC D25801 


ESTs; Higlily similar to KIAA445 protein [H.sapiens] 


2 


RC D45652 


ESTs; Weakly similar to unknown [H.sapiens] 


4 


RC D60208 f 


ESTs 


3 


RC D80504 s 


zinc finger protein 198 


2 


RC F03010 


ESTs; Weakly similar to ZINC FINGER PROTEIN HRX [Homo sapiens] 


4 


RC F04247 


ESTs; Weakly similar to !!!! ALU CLASS A WARNING ENTRY !!!! 
[H.sapiens] 


4 


RC F 10966 


ESTs; Weakly similar to 111! ALU SUBFAMILY J WARNING ENTRY III! 
[H.sapiens] 


4 


RC F 13700 


Homo sapiens ribonuclease P protein subunit p4 (RPP4) gene; complete 

cds 


4 


RC H05063 


ESTs 


4 


RC HI 6758 


ESTs' Highly similar to ERYTHROPOIETIN RECEPTOR PRECURSOR 
[Homo sapiens] 


4 


RC HI 731 5 s 


karyopherin alpha 1 (importin alpha 5) 


4 


RC H22556 


PROTEIN TRANSLATION FACTOR SUM HOMOLOG 


4 


RC H22566 


ESTs; Highly similar to protein tyrosine phosphatase epsilon cytoplasmic 

isoform [H.sapiens] 


4 


RC H48459 s 


Human mRNA for KIAA186 gene; complete cds 


4 


RC H 53073 


ESTs 


2 


RC_H56559_s 


Homo sapiens mRNA for KIAA61 protein; partial cds 


3 


RC H57957 s 


ESTs 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 




2 


RC H64938 s 


ESTs 




2 


RC H 64973 


ESTs 




4 


RC H 69535 


ESTs 




2 


RC H73110 


[H. sapiens] 


5 


2 


RC H81783 


ESTs 




1 


RC H 86259 


Homo sapiens chromosome 19; cosmid R32611 




2 


RC_H 88353 


ESTs; Weakly similar to reverse transcriptase related protein [H.sapiens] 




2 


RC H 88639 


ESTs 




4 


RC H88675 


ESTs 


10 


4 


RC H93708 s 


CLEAVAGE SIGNAL-1 PROTEIN 






RC_N22107 


ESTs; Weakly similar to llll ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 




3 


RC N 24046 


ESTs; Weakly similar to 68 RIBOSOMAL PROTEIN LI [Homo sapiens] 




2 


RC N27028 


ESTs 




2 


RC N30205 


ESTs; Weakly similar to hypothetical protein [H.sapiens] 


15 


1 


RC N30621 


ESTs 




^ 


RC N33258 


Homo sapiens nuclear receptor co-repressor N-CoR mRNA; complete cds 




2 


RC N 33390 


EST 




2 


RC N40180 


EST; Weakly similar to putative p15 [H.sapiens] 






RC N45198 


EST 


20 




RC N45979 s 


SH3 domain protein 1B 






RC N48325 


EST 






RC N48913 


ESTs 




^ 


RC N49394 


Homo sapiens mRNA for KIAA716 protein; complete cds 




1 


RC N50656 


ESTs; Highly similar to mosaic protein LR1 1 [H.sapiens] 


25 


4 


RC N50721 


kinesin family protein 3B 




4 


RC N53143 


ESTs 




2 


RC N53359 


ESTs 




4 


RC N55326 


ESTs 




2 


RC N55493 


yv5c2.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 
IMAGE:246146 3', mRNA sequence 


3 0 




RC N57493 


EST 






RC N62955 


ESTs; Weakly similar to ankyrin G [H.sapiens] 




4 


RC N63520 


EST; Weakly similar to mariner transposase [H.sapiens] 




4 


RC N63604 


ESTs 




2 


RC N64166 


frizzled (Drosophila) homolog 7 


35 


2 


RC N64168 


ESTs 




2 


RC N64191 


ESTs 




4 


RC_N66845 


ESTs; Weakly similar to !!!! ALU CLASS B WARNING ENTRY !!!! 
'H.sapiens] 




4 


RC N67135 


ESTs 
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Cluster 


Accession #/ 
PROBESET 


Gene Descri tion 







RC N67295 


ESTs 







RC N68399 


H2B Nstone famil ■ member N 






RC N 68963 


ESTs ^ 







RC N69331 


e tkj 1 rol 1 isomerase C (c do hilin C) 







RC N70777 


ESTs ^ ^ '^""^^'^^^^ °^ ' '" 






RC N71364 s 


ESTs WeakI similar to transformation-related rotein [H sa iens] 







RC N71545 s 


ESTs- Moderately similar to hypothetical protein [H sapiens]'' 







RC N71571 


ESTs 






RC N74456 


EST 


10 




RC N75594 









RC N79035 


EST 






RC N80279 


ESTs; Highly similar to (dcfline not available 4239677) [H. sapiens] 






RC N91797 






4 


RC_^N92454 


karyopherin (importin) beta 1 


15 


4 


RC_N94581 


actin; beta 




4 


RC_N94746 


ESTs 




4 


RC_N98238 


ESTs 




4 


RC^R02384 


EST 






RC R 16833 


ESTs; Weakly similar to IIM ALU CLASS F WARNING ENTRY III! 
[H. sapiens] 


2 0 




RC R41828 s 









RC R43203 


ESTs 






RC R46395 






2 


RC R58863 


ESTs 




2 


RC R78248 


ESTs 


25 


4 


RC T11483 


ESTs 






RC_T 16896 


ESTs 




2 


RC T23820 


cyclln T2 






RC T30222 


ESTs WeakI similar to tetrac dine trans orter-like rotein [M musculus] 






RC W15275 S 


ESTs ~ 


30 




RC W38194 


Accession not listed in Genbank 

__! EJllJ — en_an 







RC W42414 s 






4 


RC W46577_s 


H. sapiens mRNA for ESM-1 protein 




4 


RC W49632 s 


Human clone 2398 mRNA sequence 






RC W57613 


ESTs 


35 


2 


RC W57759 


EST 




4 


RC W61118 


ESTs 




4 


RC W65344 


ESTs; Highly similar to ICH-2 PROTEASE PRECURSOR [Homo sapiens] 




2 


RC W69216 


ESTs 




2 


RC W69379 


ESTs; Weakly similar to mitochondrial inner membrane protease 1 
IS.cerevisiae] 


40 


4 


RC W86728 


ESTs 
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Accession #/ 
PROBESET 


Gene Description 


4 


RC Z38499 


ESTs; Weakly similar to protein phosphatase [H .sapiens] 


^ 


RC Z38630 


Homo sapiens IkD protein (BC1) mRNAj complete cds 




RC Z39494 


■ 





RC Z39623 


— ^ : — Z 





RC Z40071 s 


BMX non-r6C6ptor tyrosine kin3se 





RC Z40174 


— 




? 


RC Z40182 






RC Z40904 


is 






1 


AFFX-BioB-3 






AFFX-BioC-3 






3 


AFFX-DapX-5 






AFFX-LysX-M 







RC AA1 66965 




^ 


RC AA1 67500 


is? ~~ 


1 


RC AA1 69599 s 


ESTs 




RC AA171724 


ESTs 


2 


RC AA171739 


ESTs 


3 


RC AA177105 


ESTs 


2 


RC AA1 82626 


ESTs 


3 


RC AA1 86324 


ESTs; Highly similar to cell cycle progression restoration 8 protein 
[H. sapiens] 


1 


RC AA1 92099 


zinc finger protein 148 (pHZ-52) 


3 


RC AA192173 


ESTs; Moderately similar to III! ALU SUBFAMILY SO WARNING ENTRY 

!!!! [H.sapiens] 


3 


RC AA192415 


EST 


3 


RC AA1 92553 


ESTs; Moderately similar to RGC-32 [R.norvegicus] 


3 


RC AA 194851 


ESTs 


3 


RC AA1 95520 S 


ESTs 




RC AA 196300 


ESTs; Moderately similar to ill! ALU SUBFAMILY SQ WARNING ENTRY 
[H sapiens] 


3 


RC AA196517 


Lon protease-like protein 


3 


RC AA 196549 


ESTs 




RC AA 196721 


zq9a3.s1 Stratagene muscle 93729 Homo sapiens cDNA clone 
IMAGE:629164 3' similar to TR:G746415 G746415 1 KAPPA BR. ;, mRNA 
sequence 




3 


RC AA 196729 i 


ESTs; Weakly similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 


1 


RC AA 196979 


ESTs; Moderately similar to RETROVIRUS-RELATED PROTEASE 
[H.sapiens] 


2 


RC AA206828 


ESTs; Weakly similar to ubiquitous TPR motif; Y isoform [H.sapiens] 


3 


RC AA207123 


immunoglobulin superfamily; member 3 


1 


RC AA2 14539 i 


ESTs 


3 


RC AA226914 S 


TR2 nuclear hormone receptor 
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15 



30 



35 



Cluster 


Accession #/ 
PROBESET 


Gene Description 


3 


RC_AA227260 


Zic family member 3 (odd-paired Drosophila homolog; heterotaxy 1) 


3 


RC AA227469 


EST 


3 


RC AA233122 


ESTs; Highly similar to CALCIUM/CALMODULIN-DEPENDENT 
PROTEIN KINASE TYPE II DELTA CHAIN [Rattus norvegicus] 


3 


RC AA233334 s 


Homo sapiens josephin MJD1 mRNA; cds 


3 


RC AA233347 


Homo sapiens zinc finger protein 216 splice variant 2 (ZNF216) mRNA; 

complete cds 


1 


RC_AA233519 


ESTs; Weakly similar to neuronal thread protein AD7C-NTP [H.sapiens] 


1 


RC AA233714 


Apg12 (autophagy; yeast) homolog 


1 


RC AA233796 


ESTs 


1 


RC AA235050 f 


ESTs 




RC AA235704 


ESTs; Weakly similar to Wiscott-Aldrlch Syndrome protein homolog 
[M.musculus] 




^ 


RC AA236031 


^21® 


] 


RC AA236352 


^§12 




RC AA236390 S 


^STs 





RC_AA2 36453 








RC AA243370 


— 





RC AA250947 


Ll! 




! 


RC AA251083 


121! 


2 


RC AA2511 13 


121! : 




RC AA251973 


^^^^ — : : : 




^ 


RC AA252023 


ESTs; Moderately similar to (defline not available 397874) [H.sapiens] 




RC AA252414 


^^^^ — : : : : : : 


1 


RC AA252650 


protein kinase; mitogen-activsted; kinase 7 (MAP kinase kinase 7) 


3 


RC AA255523 


ESTs 


3 


RC AA258128 


ESTs 


3 


RC AA262105 


Human mRNA for KIAA331 gene; complete cds 


1 


RC AA262107 


ESTs 


1 


RC AA262235 


ESTs 




RC_AA278298 


zs8b3.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:73757 3', 
mRNA sequence 





RC AA278529 i 


ESTs; Highly similar to serine/threonine protein kinase [H.sapiens] 


1 


RC AA278721 




1 


RC AA280036 


ESTs 


1 


RC AA280648 


ESTs; Weakly similar to rab-related GTP-binding protein [H.sapiens] 


1 


RC AA280738 


ESTs 


3 


RC_AA280794 


ESTs 


1 


RC AA280837 


ESTs 


1 


RC AA280886 


ESTs; Moderately similar to alternatively spliced product using exon 13A 
[H.sapiens] 


1 


RC AA280934 


ESTs 


1 


RC AA281535 


Homo sapiens mRNA for KIAA879 protein; complete cds 
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Cluster 


Accession #/ 

PROBESET 


Gene Description 


4 


RC AA281797 s 


Homo sapiens basic transcription factor 2 p44 (btf2p44) gene; partial cds; 
neuronal apoptosis inhibitory protein (naip) and sui^ival motor neuron 
protein (smn) genes; complete cds 


1 


RC AA282047 


ESTs 


1 


RC AA283002 


Human zinc flnger protein (SRE-ZBP) mRNA; 3' end 


3 


RC AA283709 


ESTs 


1 


RC_AA283g02 


ESTs; Weakly similar to X-linked retinopathy protein {C-terminal; clone 
XEH.Bc} [H .sapiens] 


1 


RC^AA284108 


Human DNA from chromosome 19-speciflc cosmid F25965; genomic 
sequence 


1 


RC AA284109 


Human DNA sequence from clone 71 L16 on chromosome Xpl 1 . Contains 
a probable Zinc Finger protein (pseudo)gene; an unknown putative gene; 




KL» AAZo40/ 1 


HomTsa ^16115^011^2396^ ^ 

Homo sapiens cone un nown m ■ ^ 





RC AA284744 f 






RC AA284784 







RC AA284840 


EST^ 




RC AA286844 


ESTs 




RC AA287032 






RC AA287038 


ES? 




RC AA287546 


isTs 




RC AA287553 s 


isTs 




KO AMZO / ODD 


ESTs; Weakly similar to !!!! ALU CLASS B WARNING ENTRY !!!! 
[H.S3pi6ns] 




RC AA287564 


hbosomsl prot6in L37 





RC AA291015 S 


CDC7 (c6ll division cyci6 7\ S. C6r6visi36| homoio9)~iik6 1 





RC AA291716 






RC AA291749 S 


ESTs 


1 


RC AA293656 


EST 


1 


RC AA302430 


ESTs 




RC AA302809 


EST 


1 


RC AA302820 S 


purlnergic receptor P2X; llgand-gated Ion channel; 4 


1 


RC AA3 10499 


ESTs 


^ 


RC AA321890 


EST24442 Cerebellum II Homo sapiens cDNA 3' end, mRNA sequence 




RC^AA340589 







RC AA340622 


EST 

! 





RC AA342457 i 


! . 


■ 


RC AA342828 s 


glycoprotein V (pi3tei6t) 




RC AA342864 






RC AA342973 


ESTs 




RC_AA346495 


ESTs 




RC_AA347573 


Homo sapiens KIAA45 mRNA; complete cds 




RC AA347614 


ESTs 




RC AA347717 


ESTs 
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10 



20 



30 



Cluster 


Accession #/ 
PROBESET 


Gene Description 


1 


RC AA348913 


ESTs 


1 


RC AA349647 


EST 




RC_AA349773 


ESTs 


1 


RC AA350541 s 


ESTs; Moderately similar to alternatively spliced product using exon 1 3A 

[H. sapiens] 


1 


RC AA357159 i 


EST 




RC AA357172 i 


ESTs 




RC AA369856 s 


Human hVs41 fhVPS41) mRNA- alternative s lice variant- artial cds 




RC AA370132 


_^^ian — p p] )_m . a erna ive sp ice vanan , pa la c s — 





RC AA370472 S 


isTs 





RC AA370867 


isTs 




RC AA377296 


ESTs 




RC AA383902 


ESTs 




RC_AA385934 


EST; Highly similar to predicted using Genefinder [C.elegans] 




RC AA386255 


EST 




RC AA386260 


EST 




RC AA386266 


ESTs; Highly similar to MEMBRANE GLYCOPROTEIN M6-B [Mus 


■ 


RC AA398014 


ESTs"'"^^ 




RC AA398222 


isTs 


■ 


RC AA398235 


isTs 




RC AA398348 


ESTs 




RC AA398482 






RC AA398504 


E^ 





RC_AA398505 


isTs 


^ 


RC_AA398507 


isTs 

! 




RC AA398523 


ESTs; Weakly similar to !!!! ALU SUBFAMILY SQ WARNING ENTRY !!!! 
[H.S3pi6ns] 




RC AA398625 







RC AA398632 


ESTs 




RC AA398633 


ESTs 




RC AA398894 


ESTs 




RC AA398895 


EST 




RC AA398900 


ESTs 




RC AA398904 


EST 




RC AA399122 


ESTs Weakly similar to mitochondrial citrate transport protein [H sapiens] 




RC AA399371 


ESTs 




RC_AA399373 


ESTs; Highly similar to KIAA568 protein [H.sapiens] 




RC_^AA399441 


ESTs 




RC^AA399636 


ESTs 




RC AA399640 


ESTs 




RC AA399680 


ESTs 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 




3 


RC AA400080 


EST 




1 


RC AA400262 


ESTs 




1 


RC_AA400725 


ESTs 






RC AA400748 


ESTs 


5 


1 


RC_AA400780 


ESTs 






RC AA401631 


zv65b9.s1 Soares_total_fetus_Nb2HF8_9w Homo sapiens cDNA clone 
IMAGE:758489 3', mRNA sequence 




"I 


RC AA401688 


ESTs 




"I 


RC_AA401695 


EST 






RC AA402227 


ESTs; Moderately similar to N-tropomodulin [R.norvegicus] 


10 




RC AA402329 


ESTs 




"I 


RC..AA402398 


ESTs 






RC^AA402449 


EST 






RC_AA402468 


ESTs 






RC AA403268 S 


ESTs 


15 




RC_AA403314 


ESTs 






RC AA404229 


EST 




1 


RC_AA404260 


ESTs 




1 


RC_AA404271 


Human glutamate/kainate receptor subunit (EEA3) mRNA; complete cds 






RC_AA405026 


ESTs 


20 


1 


RC AA405182 


ESTs 




„.„J 


RC_AA405237 


ESTs; Moderately similar to alternatively spliced product using exon 13A 
[H.sapiens] 






RC_AA406061 


EST 






RC AA406063 


ESTs 






RC AA406070 


EST 


25 




RC_AA406137 


EST 






RC AA406335 


ESTs 






RC AA411801 


IHuman mRNA for KIAA37 gene; complete cds 






RC_AA411804 


ESTs 






RC_AA411833 


ESTs; Highly similar to (defline not available 4521278) [H.sapiens] 


30 


"I 


RC AA412219 


ESTs 






RC AA412259 


ESTs 






RC_AA4 12497 


Human Llne-1 repeat mRNA with 2 open reading frames 






RC AA4 12498 


ESTs 




1 


RC_AA416586 


ESTs 


35 


^ 


RC AA4 16867 


^§1 




^ 


RC AA4 16874 


|STs 




^ 


RC AA421133 








RC AA421138 


ES? 






RC AA422079 


ESTs; Highly similar to ELONGATION FACTOR G; MITOCHONDRIAL 
PRECURSOR [Rattus norvegicus] 


40 




RC_AA423837 


ESTs 






RC_AA424328 


ESTs 






RC AA424339 


ESTs 
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Cluster 


PROBESET 


Gene Description 




3 


RC AA424469 s 


ESTs 




1 


RC AA424502 


ESTs 




3 


RC AA425004 


ESTs 




^ ... 


RC AA425734 


ESTs; Weakly similar to neuronal thread protein ADTc-NTP [H. sapiens] 


5 


'' 


RC AA425887 


ESTs 






RC_AA426456 


ESTs 






RC AA427396 


ESTs 




'' 


RC AA427555 


Human mRNA for KIAA23 gene; complete cds 






RC AA428218 


ESTs 


10 




RC AA428242 








RC AA428281 


EST 






RC AA428865 


EST 






RC AA428994 


ESTs 




1 


RC_AA429666 


ESTs 


15 




RC AA430181 


ESTs 




^ 


RC AA430184 s 


Human putative ATP/GTP-binding protein (HEAB) mRNA; complete cds 






RC AA431288 s 


CD3D antigen; delta polypeptide (TiT3 complex) 




^ 


RC AA431293 


ESTs 






RC AA431478 


ESTs 


20 




RC AA431492 


EST 




^. 


RC AA431732 


EST 






RC AA432278 


EST 




i 


RC_AA43441 1 


ESTs 






RC AA435512 J 


ESTs 


25 




RC_AA435698 


ESTs 




^ 


RC AA435711 


Homo sapiens mRNA for KIAA712 protein; complete cds 




3 


RC AA435815 S 


Clk-associating RS-cyclophllln 




3 


RC_AA435842 


ESTs 




3 


RC AA436475 


ESTs 


30 


3 


RC AA436489 


ESTs 




3 


RC_AA442060 


ESTs 




1 


RC AA442079 


EST 




3 


RC AA443151 


ESTs 






RC AA446133 


ESTs 


35 




RC AA447145 


Homo sapiens KIAA399 mRNA. partial cds 




3 


RC AA447398 









RC AA447643 


ESTs 




^ 


RC AA447742 s 


dynein; axonemal; heavy polypeptide 17-like 




3 


RC AA448226 


zw96c1.s1 Scares total fetus Nb2HF8 9w Homo sapiens cDNA clone 
IMAGE:7B4818 3', mRNA sequence 


40 


1 


RC AA448825 


EST 






RC AA449444 


ESTs 




3 


RC AA450087 


requlator of Gz-selective protein signaling 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 




3 


RC_AA45021 1 


EST 




1 


RC AA450244 


ESTs 




3 


RC AA452123 


ESTs; Wealtiy simliar to Tcp-1 [iVI.musculus] 




3 


RC AA452155 


zinc finger protein 198 


5 


3 


RC AA452156 


EST 




3 


RC AA453036 


ESTs 




3 


RC AA453526 


ESTs 




3 


RC AA454085 


EST 




3 


RC AA454103 


ESTs 


10 


1 


RC AA454642 


ESTs 




1 


RC AA454935 


ESTs 




3 


RC AA456323 


ESTs 




3 


RC_AA457395 


EST 






RC AA458850 


aa26c7.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IIVIAGE:81438 




3 


RC AA459662 


3^s|milar to contains L1.t3 L1 repetitive element ., mRNA sequence 




3 


RC_AA459668 


Homo sapiens 3-liydroxyisobutyryl-coenzyme A liydrolase mRNA; 

complete cds 




1 


RC^AA459679^s 


ESTs; Weakly similar to The KIAA191 gene is expressed ubiquitously. 
[H. sapiens] 




1 


RC_AA459702 


ESTs 




4 


RC_AA460017_f 


ESTs 


20 


3 


RC AA460324 


ESTs 




3 


RC AA461509 


ESTs; Weakly similar to liypotlietical protein II [H.sapiens] 




3 


RC_AA464414__i 


ESTs 




1 


RC AA464428 


ESTs 




3 


RC AA470084 


ESTs 


25 


3 


RC AA476606 s 


ESTs 




3 


RC AA478521 


ESTs 




3 


RC AA478523 


ESTs; Moderately similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens) 




3 


RC_AA479949 


ESTs 




3 


RC_AA481252 


RAS-LIKE PROTEIN TC21 


30 


1 


RC_AA485351 


ESTs; Wealtly similar to predicted using Genefinder [C.elegans] 




1 


RC_AA487264 


ESTs 




1 


RC_AA489072 


Homo sapiens mRNA for KIAA87 protein; complete cds 




1 


RC_AA489630 


Homo sapiens mRNA for KIAA665 protein; complete cds 




2 


RC AA490225 


ESTs 


35 


3 


RC AA490227 


ESTs 




3 


RC AA490255 


ESTs 




1 


RC_AA490890 


ESTs 




2 


RC_AA490916_s 


ESTs 




3 


RC AA490925 


Homo sapiens laforin (EPM2A) mRNA; partial cds 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 




1 


RC AA490955 


ESTs; Weakly similar to bullous pemphigoid antigen [IVl.musculus] 




1 


RC AA495812 


ESTs 




3 


RC AA495824 


ESTs 




1 


RC AA496369 


ESTs 


5 


3 


RC AA504125 s 


ESTs 






RC AA521473 


Human brain secretory protein hSeclp (HSEC1 ) mRNA; complete cds 






1 


RC AA598440 







3 


RC AA598899 i 


.^STs _ ^ 




3 


RC AA599244 


Homo sapiens mRNA for KIAA53 protein; partial cds 


10 


1 


RC AA599694 s 


Human mRNA for KIAA133 gene; complete cds 




1 


RC AA600037 


^^Ts _ 




3 


RC AA609135 


^21 




1 


RC_AA609582 


Homo sapiens p6 katanin mRNA; complete cds 




3 


RC AA609684 


^^l! 


15 


3 


RC_AA609839 


^STs 






RC AA609862 


Homo sapiens mRNA for RBP-MS/type 3; complete cds 






4 


RC AA620423 


— 




^ 


RC AA620747 






1 


RC AA621364 


— 




20 


2 


RC C20653 







3 


RC D20085 







1 


RC D20749 


ISTs 




2 


RC D51285 s 







4 


RC D59972 i 





25 


^ 


RC F04112 f 


HE! 




2 


RC F 13604 







^ 


RC HOI 662 







1 


RC H05135 i 


_ . 




3 


RC HI 2245 


splicing factor; arginine/serine-rich 7 (35kD) 


30 


1 


RC H22842 


EST 




1 


RC_H30894 


. 




2 


RC H43442 S 


Human mRNA for KIAA28 gene; partial cds 




3 


RC H45996 


ESTs 




2 


RC H69281 i 


HE! 


35 


3 


RC H69485 f 


^^^2 _ ; 




1 


RC H69899 


ESTs; Moderately similar to unknown [H.sapiens] 




4 


RC H70627 s 







1 


RC H73050 s 


Rhesus blood group; D antigen 






RC H73260 




40 


1 


RC H77531 s 


HIR (histone cell cycle regulation defective; S. cerevisiae) homolog A 




2 


RC H80552 


EST 




4 


RC_H80737_s 


lysyl oxidase 




1 


RC_H93412 


ESTs 




3 


RC H94892 s 


v-ral simian leukemia viral oncogene homolog A (ras related) 
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PROBESET 


Gene Description 


4 


RC_H95643_s 


neurotrophic tyrosine kinase; receptor; type 1 


2 


RC H96552 


ESTs 


1 


RC H97146 


ESTs; Highly similar to G protein-coupled receptor kinase 6; splice variant 

B [H.sapiens] 


^ 


RC H99131 s 


■^^-'^ : : : 


1 


RC H99462 s 


ribosomal protein; mitochondrial; L12 


1 


RC H99837 s 


^515 


2 


RC N22140 


ESTs; Highly similar to TUBULIN GAMMA CHAIN [Euplotes 

octocarinatus] 


2 


RC N22197 


ESTs 




RC__N23756_s 


Human mRNA for KIAA238 gene; partial cds 


2 


RC N24134 


eukaryotic translation Initiation factor 1A; Y chromosome 


4 


RC N24195 


Homo sapiens mRNA for RanBPM; complete cds 


1 


RC N26739 


CAAX box 1 


2 


RC N27098 


EST 


1 


RC N27637 


ESTs 


4 


RC N33090 


ESTs; Weakly similar to translation initiation factor [H.sapiens] 




RC N35967 


ESTs 


1 


RC N38959 f 


Homo sapiens chaperonin containing t-complex polypeptide 1; beta 
subunit (Cctb) mRNA; complete cds 


2 


RC N39069 


ESTs 


1 


RC_N46441 


ESTs 


2 


RC N48270 f 


ESTs 


2 


RC N48365 s 


ESTs 


2 


RC N51316 


ESTs 


1 


RC N51499 s 


ESTs 


4 


RC N53976 


ESTs 


2 


RC N54157 


ESTs 


2 


RC N54300 


ESTs 


1 


RC^N54831 


ESTs; Weakly similar to neuronal thread protein AD7c-NTP [H.sapiens] 


2 


RC_N59849 


ESTs 


4 


RC_N62132 


ESTs 


1 


RC N62375 


EST 


4 


RC N63138 


ESTs 


1 


RC N63172 


cell division cycle 42 (GTP-binding protein; 25kD) 




RC_N63772 


1q32.3.-4'l. Contains the HSD11B1 gene for Hydroxysteroid (11-beta) 
Dehydrogenase 1 ; the AD0RA2BP adenosine A2b receptor LIKE 
pseudogene; the IRF 


2 


RC N63787 


ESTs 


2 


RC_N68168 


za11c1.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 
IIV1AGE:292224 3', mRNA sequence 


2 


RC N68201 


ESTs; Weakly similar to hypothetical protein [H.sapiens] 


2 


RC N68300 


ESTs 
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Accession #/ 
PROBESET 


Gene Description 






RC_N68321 


solute carrier family 2 (facilitated glucose transporter); member 3 






RC N69575 


EST 






RC N75007 


ESTs 




1 


RC N75542 


ESTs 


5 




RC N90066 


Homo sapiens clone 24689 mRNA sequence 




1 


RC N91246 


ESTs 






RC N92751 


ESTs; Weal<ly similar to cyclic nucleotide-gated channel beta subunit 
[R.norvegicus] 






RC_N93214_s 


ESTs 






RC N99148 


ESTs; Highly similar to MKR2 PROTEIN [Mus musculus] 


10 




RC R07876 


ESTs; Weakly similar to HYPOTHETICAL PROTEIN H1 1723 
[Haemophilus influenzae] 







RC R10865 f 


alpha-fetoprotein 







RC R11056 


ESTs 






RC R11488 


ESTs 






RC R22947 


ESTs 


15 





RC_R23g30_S 


ESTs 






RC R26589 f 


ESTs 







RC R37588 S 


GDS-related protein 







RC R37613 


ESTs 




'' - 


RC R38398 


Homo sapiens clone 23758 mRNA sequence 


20 




RC R39179 f 


ESTs 




3 


RC R40923 


ESTs 






RC R41 179 


Human mRNA for KIAA328 gene; partial cds 




2 


RC R41294 s 


ESTs 






RC R42307 f 


early development regulator 2 (homolog of polyhomeotic 2) 


25 




RC R43189 f 


EST 






RC R43306 


ESTs 




1 


RC R44357 


ESTs 






RC R44519 


EST- Moderatel similar to Pro Pol dUTPase ol rotein fM musculus! 
0 eraeysimiar o ro- o- ase po ypro em [ .muscuus] 






RC R45088 


yg38g4.s1 Soares infant brain 1NIB Homo sapiens cDNA clone 
IMAGE:34896 3", mRNA sequence 


30 




RC R47948 i 


ESTs 






RC R51524 


ESTs 






^- 


RC R54950 


ESTs 




^ 


RC R55241 


EST 






RC R59585 


ESTs 


35 





RC_R60044 


ESTs 






RC R60872 


ESTs 






RC R66690 


ESTs 






RC R67266 s 


exostoses (multiple)-like 1 






RC_R73588 


ESTs 


40 


3 


RC R79403 


ESTs 
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Accession #/ 
PROBESET 


Gene Description 


1 


RC_R87647 


ESTs 




RC R93622 


ESTs 


4 


RC_R99599_s 


heterogeneous nuclear ribonucleoprotein U (scaffold attachment factor A) 


f 




ESTs 


1 


RC_T02888 


FB14D6 Fetal brain, Stratagene Homo sapiens cDNA clone FB14D6 
3'end, mRNA sequence 


1 


RC T03170 


EST 


2 


RC T10465 


hbc313 Human pancreatic islet Homo sapiens cDNA clone hbc313 3'end, 
mRNA sequence 


1 


RC T15418 f 


ESTs 


1 


RC T15597 f 


Homo sapiens mRNA for KIAA661 protein; complete cds 


2 


RC T15652 i 


ESTs 


2 


RC T16898 s 


ash2 (absent; small; or homeotic; Drosophila; homolog)-like 




RC T26644 i 


ESTs; Weakly similar to zinc finger protein ZNF1 39 [H.sapiens] 


2 


RC_T40841 


ESTs 


^ 


RC_T47566_i 


yb15c1 l.sl Stratagene placenta (#937225) Homo sapiens cDNA clone 
IMAGE:71252 3' similar to similar to gb:Z2157 ELONGATION FACTOR 
1 -DELTA (HUMAN), mRNA sequence 


2 


RC_T50116 


ESTs; Moderately similar to EA22 GENE PROTEIN [Bacteriophage 
lambda) 


2 


RC T50145 s 


FSHD region gene 1 




RC T58615 


ESTs; Moderately similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! 
[H.sapiens] 


^ 


RC T59940 f 




4 


RC_T63595 


ESTs 


2 


RC T64891 


yd1c2.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone 
IMAGE:66722 3", mRNA sequence 


2 


RC T64924 


ESTs 




RC T64933 r 


ESTs, Weakly similar to hypothetical protein [H.sapiens] ^ 


2 


RC_T68875 


yc3f5.s1 Stratagene liver (#937224) Homo sapiens cDNA clone 
IMAGE:8229 3', mRNA sequence 


2 


RC T69027 


ESTs 


3 


RC T69924 


yc19d3.s1 Stratagene lung (#93721) Homo sapiens cDNA clone 
IMAGE:81 125 3", mRNA sequence 


3 


RC T70353 


ESTs 


1 


RC_T79780_S 


ESTs; Weakly similar to PUTATIVE MITOCHONDRIAL CARRIER 
YBR291C [Saccharomyces cerevisiae] 


2 


RC__T79951 


ESTs 


3 


RC__T80174_s 


ESTs 


3 


RC T80622 


ESTs; Weakly similar to envelope protein RIC-7 [H.sapiens] 


1 


RC_T85352 


ESTs 


1 


RC T85373 


ESTs 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 


2 


RC T86284 


ESTs; Weakly similar to transformation-related protein [H.sapiens] 


1 


RC T89579 s 


Homo sapiens E2F-related transcription factor (DP-1) mRNA; complete 

cds 


3 


RC T90360 


ESTs 


2 


RC T94328 i 


ESTs 


1 


RC T95590 


ye4a3.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone 
IMAGE:12172 3' similar to gb|M1817|IGURRAA Iguana iguana 58 
(rRNA);, mRNA sequence 


4 


RC T97257 f 


ESTs; Weakly similar to tiypotfietical protein [H.sapiens] 


2 


RC T97599 i 


ESTs 


2 


RC T97620 


ESTs; Weakly similar to unknown [H.sapiens] 


4 


RC T97775 


ESTs 


3 


RC T98152 


fibrillin 2 


1 


RC W31479 


ESTs 




RC W37999 


ESTs 


2 


RC W38240 


Accession not listed in Genbank 


2 


RC W40150 


tiuman chromosome-associated polypeptide (bamacan) 


2 


RC W45435 


Homo sapiens mRNA for KIAA784 protein; partial cds 


2 


RC W58202 


ESTs 


1 


RC W58344 


ESTs 


2 


RC W58650 


ESTs 


4 


RC W6a736 


Human DNA sequence from clone 1 1 89B24 on ctiromosome Xq25-26.3. 
Contains NADH-Ubiquinone Oxidoreductase MLRQ subunit (EC 1.6.5.3; 
EC 1.6.99.3; CI-MLRQ); Tubulin Beta and Proto-oncogene 
Tyrosine-protein 


2 


RC W69106 


ESTs 


2 


RC W69111 


ESTs 


1 


RC W69385 s 


H.sapiens NuMA gene (Clone T33) 


3 


RC W69399 s 


ATPase; Ca++ transporting; plasma membrane 1 


3 


RC W69459 


ESTs 


2 


RC W72424 


SI calcium-binding protein A9 (calgranulin B) 


2 


RC W72724 


ESTs 


2 


RC W72834 


ESTs 


1 


RC W73955 


Homo sapiens chromosome 19; cosmid R26445 


2 


RC W74701 


ESTs 


2 


RC_W76540 


ESTs 


2 


RC W79397 


ESTs 




RC W85888 


ESTs; Weakly similar to synapse associated protein sap47-2 
[D.melanogaster] 


2 


RC W85038 


ESTs 


2 


RC W86881 


ESTs 


2 


RC W87804 


ESTs 


2 


RC W88942 


zh7b5.s1 Soares_fetal_liver_spleen_1NFLS_S1 Homo sapiens cDNA 
clone IMAGE:417393 3', mRNA sequence 
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Cluster 


Accession #/ 
PROBESET 


Gene Description 


3 


RC W90022 


ESTs; Highly similar to LECT2 precursor [H.sapiens] 


2 


RC W92272 


Homo sapiens zinc-finger helicase (hZFH) mRNA; complete cds 


2 


RC W92764 s 


TUMOR NECROSIS FAGTOR-INDUCIBLE PROTEIN TSG-6 
PRECURSOR 


2 


RC W93040 


ESTs 


3 


RC W93092 


neutral sphingomyelinase (N-SMase) activation associated factor 


2 


RC W93227 


EST 


2 


RC W93523 


ESTs 


2 


RC W93659 |eSTs 


2 


RC W94003 s 


ESTs 


2 


RC W94401 s 




2 


RC W94688 


Homo a iens mRNA for erill in- com letecds 

Homo sapiens m or pen ipin, comp e e c s 


2 


RC W94787 S 




2 


RC Z38294 s 


isTs 


3 


RC Z3831 1 


isTs 

— ! 


^ 


RC Z38465 s 






RC Z38525 S 


EsS ' 





RC Z38538 f 


ESTs 





RC Z38551 S 


ESTs 




2 


RC Z38783 s 


ca2+-dependent activator protein for secretion; Ca2+-regulated 
cytosl<eletal protein (CAPS) 




RC Z39113 


ESTs 


4 


RC Z39255 f 


ESTs 




RC Z39591 


EST 


2 


RC Z39783 s 


ESTs 


2 


RC Z39920 


ESTs; Highly similar to NADH-CYTOCHROME B5 REDUCTASE [Bos 
taurus] 


2 


RC Z40166 f 


ESTs 


3 


RC Z40388 s 


ESTs 


2 


RC Z40646 


ESTs 


2 


RC Z41697 


ESTs 


2 


RC Z99349 


ESTs 


2 


RC Z99394 s 


ESTs: Weaklv similar to transformation-related protein fH.saoiensI 
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TABLE 3 





Exemplar 
Accession 


Complete Title 


UniGenelD(1 1/29/99) 




D86425 


Homo sapiens mRNA for nidogen-2 


Hs .82733 


5 


D86983 


Human mRNA for KIAA0230 gene; partial cds 


Hs.11 8893 




HG1098-HT1098 


Cystatin D 






HG1103-HT1103 


"Guanine Nucleotide-Bindinq Protein Ral, Ras-Oncoqene 






HG3342-HT3519 


Id1 






J03764 


plasminogen activator intiibitor; type 1 


Hs.82085 


10 


L06797 


ctiemol<ine (C-X-C motif); receptor 4 (fusin) 


Hs.89414 




L15388 


"Human G protein-coupled receptor kinase (GRK5) mRNA, 


Hs.211569 




L20971 


phosphodiesterase 4B- cAMP specific (dunce 
(Drosophila)-homoloq phosphodiesterase E4) 


Hs.188 




L35545 


endothelial cell protein C/activated protein C receptor 


Hs.82353 




L76380 


calcitonin receptor-like 


Hs.152175 


15 


M21305 


Human aloha satellite and satellite 3 junction DNA sequence 


Hs.247946 




M24736 


selectin E (endothelial adhesion molecule 1) 


Hs.89546 




M31166 


pentaxin-related gene; rapidly induced by IL-1 beta 


Hs.2050 




M31551 


piasminoqen activator inhibitor; type II (arginine-serpin) 


Hs.75716 




M32334 


intercellular adhesion molecule 2 


Hs.83733 


20 


M61916 


laminin; beta 1 


Hs.82124 




M68874 


"Human phosphatidylcholine 2-acylhydrolase (cPLA2) mRNA, 






M74719 


transcription factor 4 


Hs.75356 




M92934 


connective tissue growth factor 


Hs.75511 




M94856 


fatty acid bindinq protein 5 (psoriasis-associated) 


Hs.153179 


25 


U03057 


singed (Drosophila)-like (sea urchin fascin homoloq like) 


Hs.1 18400 




U03877 


EGF-containina fibulin-like extracellular matrix protein 1 


Hs.76224 




U 18300 


damage-specific DNA binding protein 2 (48kD) 


Hs.77602 




U27109 


Human prepromultimerin mRNA; complete cds 


Hs.32934 




U31384 


guanine nucleotide bindinq protein 1 1 


Hs.83381 


30 


U33053 


protein kinase C-like 1 


Hs.2499 




U59423 


MAD (mothers aqainst decapentaplegic; Drosophila) homoloq 1 


Hs.79067 




U70322 


karyopherin (importin) beta 2 


Hs.168075 




U81607 


kinase scaffold protein gravin 


Hs.788 




U83463 


syndecan bindinq protein (syntenin) 


Hs.8180 


35 


U89942 


lysyl oxidase-like 2 


Hs.83354 




X04729 


Human mRNA for plasminogen activator inhibitor type 1 






X06256 


integrin; alpha 5 (fibronectin receptor; alpha polypeptide) 


Hs.149609 




X07820 


matrix metalloprotelnase 10 (stromelysin 2) 


Hs.2258 




X54925 


matrix metalloprotelnase 1 (interstitial collagenase) 


Hs.83169 


40 


X54936 


placental growth factor; vascular endothelial growth factor-related 


Hs.2894 




X60957 


tyrosine kinase with immunoglobulin and epidermal growth factor 


Hs.78824 




X67235 


hematopoietically expressed homeobox 


Hs.1 18651 
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25 



Exemplar 
Accession 


Complete Title 


UniGeneIDd 1/29/99) 


X67951 


proliferation-associated gene A (natural killer-enhancing factor A) 


Hs 180909 


X69910 


H. sapiens p63 mRNA for transmembrane protein 


Ms 74368 


X79981 


cadherin 5; VE-cadherin (vascular epithelium) 


Hs 76206 


218951 


caveolin 1; caveolae protein; 22kD 


Hs 247266 


AA187101 


"zp61b6.r1 Stratagene endothelial cell 937223 Homo sapiens 
cDNA clone IMAGE:624659 5', mRNA sequence" 




N24990 


ESTs 


Hs.26418 


R81003 


Homo sapiens serine protease mRNA; complete cds 


Hs.1 54737 


AA025351 


ESTs 


Hs. 134797 


AA027168 


ESTs 


Hs.10031 


AA040465 


ESTs 


Hs.8728 


AA0451 36 


ESTs 


Hs.22575 


AA054087 


phospholipase A2; proup IVC (cytosollc; calcium-independent) 


Hs. 18858 


AA071089 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SC WARNING 


Hs.1 87932 


AA085918 


H.sapiens HUNKI mRNA 


Hs.247482 


AA1 87490 


ESTs 


Hs.21941 


AA227926 


ESTs 


Hs.6682 


AA234743 


ESTs 


Hs.22120 


AA236559 


ESTs; Weakly similar to neuronal thread protein AD7c-NTP 


Hs.8768 


AA292694 


ESTs 


Hs.3807 


AA398243 


ESTs; Moderately similar to (defllne not available 3694664) 


Hs.21806 


AA406363 


ESTs 


Hs.30822 


AA4 11465 


ESTs 


Hs.8619 


AA4 12284 


poliovirus receptor 


Hs.171844 


AA423987 


ESTs 


Hs.7567 


AA425309 


ESTs 


Hs.33287 


AA435896 


ESTs 


Hs.1 8397 


AA448238 


Homo sapiens mRNA for KIAA0915 protein; complete cds 


Hs.16714 


AA478778 


ESTs 


Hs.1 6450 


AA621714 


ESTs 


Hs.25338 


D51069 


Human isolate JuSo MUC1 8 glycoprotein mRNA (3' variant); 


Hs.211579 


T34527 


UDP-N-acetyl-alpha-D-galactosamine:polypeptide 
N-acetylpalactosaminyltransferase 1 (GalNAc-T1) 


Hs.80120 


U97519 


podocalyxin-like 


Hs.1 6426 


AA127221 


JSTs 


Hs. 71059 


AA1 32983 


ESTs; Moderately similar to C-1-TETRAHYDROFOLATE 
SYNTHASE; CYTOPLASMIC fH.sapiens] 


Hs.44155 


AA1 35606 


ESTs; Weal<ly similar to !!!! ALU SUBFAMILY SB WARNING 


Hs.1 89384 


AA156125 


ESTs 


Hs.72116 


AA1 79845 


RAB6 Interacting; ldnesin-lil<e (rabkinesin6) 


Hs.73625 


AA232645 


ESTs 


Hs.42699 


F10399 


ESTs 


Hs.1 4763 
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Complete Title 


UniGenelD(11/29/99) 


H 16772 


ESTs 


Ms. 3 1444 


N39584 


ESTs 


Ms. 17404 


N52006 


UDP-N-acetyl-alpiia-D-galactosamine:polypeptide 
N-acetylqalactosaminyltransferase 1 {GalNAc-T1) 


Hs.80120 


N53375 


Homer; neuronal immediate early gene; 3 


Hs.1 66146 


N54067 


Homo sapiens mRNA for NIK; partial cds 


Hs.3628 


N64436 


ESTs 


Hs.20813 


R26892 


ESTs 


Hs.221434 


T33637 


ESTs 


Hs.6841 


T57112 


"yc20g11.s1 Stratagene lung (#937210) Homo sapiens cDNA 
clone IMAGE:81284 3', mRNA sequence." 




W80763 


ESTs; Moderately similar to FK506-bindinq protein 65I<D 


Hs.3849 


AA046808 


ESTs; Hiqhiy similar to 40S RIBOSOMAL PROTEIN S27 


Ms. 108957 


AA253217 


ESTs 


Hs.41271 


AA255991 


ESTs 


Hs.175319 


AA258138 


ESTs 


Hs.88297 


AA426573 


ESTs 


Hs.41135 


AA443793 


ESTs 


Hs.94761 


AA490588 


ESTs 


Hs.43118 


AA496257 


ESTs; Weakly similar to (defline not available 3513303) 


Hs.72165 


AA609717 


ESTs; Weakly similar to MICROTUBULE-ASSOCIATED 


Hs.66048 


D59570 


ESTs 


Hs. 17132 


F13787 


ESTs 


Hs.58596 


H88157 


ESTs 


Hs.41105 


H98g88 


ESTs 


Hs.42612 


N34287 


unc5 (C.eleqans homolog) C 


Hs.44553 


N52090 


EST 


Hs.47420 


N66845 


ESTs; Weakly similar to ALU CLASS B WARNING ENTRY !!!! 


Hs. 165411 


N68905 


small inducible cytokine A5 (RANTES) 




R32894 


ESTs 


Hs,45514 


R61715 


ESTs 


Hs.1 38237 


R71234 


"yi54c08.s1 SosrGs pIscGDts Nb2HP Homo sspisns cDNA clon6 
IMAGE:143054 3' similar to gb|M87908|HUMALNE32 Human 
carcinoma cell-derived Alu RNA transcript, (rRNA); gb:S41458 
ROD CGMP-SPECIFIC 3",5'-CYCLIC PHOSPHODIESTERASE 




R98105 


"yrSOgll.sl Scares fetal liver spleen 1NFLS Homo sapiens cDNA 
clone IMAGE;206852 3', mRNA sequence." 




T97186 


small inducible cytokine A5 (RANTES) 




W80814 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SB WARNING 


Ms. 193700 


AA404418 


EST 


Hs.144953 


AA405747 


ESTs; Moderately similar to HMG-box transcription factor 


Hs.97865 
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AA488687 


ESTs; Moderately similar to Ml! ALU SUBFAMILY SQ WARNING 


Hs.190307 




AA599143 


ESTs; Moderately similar to lill ALU SUBFAMILY SQ WARNING 






AA608588 


ESTs 


Hs. 193634 




AA608751 


ESTs' Moderately similar to ALU SUBFAMILY SC WARNING 


Hs.244904 


5 


C13961 


EST 


Hs.210115 




D60302 


ESTs 


Hs. 108977 




H94892 


v-ral simian leukemia viral oncopene homolog A (ras related) 


Hs.6906 




N93521 


transcription factor 4 


Hs.241362 




N95477 


ESTs 


Hs. 102943 


10 


R60044 


ESTs; Weakly similar to II!! ALU SUBFAMILY J WARNING 


Hs. 106706 




R70506 


ESTs; Moderately similar to transfonnation-related protein 


Hs. 1071 59 




T91518 


"ye20f05.s1 Stratagene lung (#937210) Homo sapiens cDNA 
clone IMAGE:1 1 8305 3' similar to contains Alu repetitive 
element;contains MER12 repetitive element ;, mRNA sequence." 






T95333 


ESTs; Weakly similar to Strabismus [D.melanogaster] 


Hs. 122730 




R45630 


ESTs; Highly similar to KIAA0372 [H .sapiens] 


Hs. 170098 


15 




"yg05c07.r1 Soares infant brain 1NIB Homo sapiens cDNA clone 
iMMoc.oi*M-4 0 , mKiNM sequence. 








ESTs; Moderately similar to envelope protein fH .sapiens! 


Hs. 23986 




AI024874 


ESTs: Weakly similar to (defline not available 3882257) 


Hs. 57958 




W26247 


U5 snRNP-specific protein (220 kD); ortholog of S. cerevislae 


Hs.6413 




AA856990 


ESTs 


Hs. 125058 


20 


AA1 36653 


ESTs 






AA 358869 


ESTs; Highly similar to SEC13-RELATED PROTEIN [H.sapiens] 


(-1 3 227949 




AI123976 




Hs 105689 




Al 369384 


a IsulfaTaseD 






AA379500 


estT ^ 


ns.nyoloo 


25 


^49693 


"ests 


Hs 1 07708 




AA1 95678 


Homo SB lens mRNA for KIAA0465 rotein- arlial cds 


Hs 1 08258 




\^ 30257 


vascular^ceTadhesion molecule 1 — ^ 


Hs. 109225 




AA028131 


ESTs 


Hs 1 10342 






"Human von Wlllebrand factor mRNA 3' end" 


Hs 1 10802 


30 


J03040 


secreted protein- acidic cvsteine-rlch'(osteonectin) 


Hs 1 1 1779 




M86933 


ameloqenin (Y chromosome) 


Hs.1238 




AA012933 


tubulin-specific chaperone d 


Hs.241687 




AA286710 


lymphocyte adaptor protein 


Hs.13131 




AA243278 


ribosomal protein; mitochondrial; L12 


Hs.1 09059 


35 


D59711 


ESTs 


Hs.237289 




T94452 


"ye36g7.s1 Stratagene lung (#93721 ) Homo sapiens cDNA clone 
IMAGE:119868 3', mRNA sequence" 


Hs.241207 
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UniGenelD(1 1/29/99) 




AA053400 


ESTs ^ 


Hs.241227 




AA370302 


Homo sapiens mRNA: cDNA DKFZp586l1518 (from clone 


Hs.21739 




J05008 


endothelin 1 


Hs.2271 




U85193 


nuclear factor l/B 


Hs.33287 


5 


AA256153 


ESTs 


Hs.23912 




X83107 


BMX non-receptor tyrosine l<inase 


Hs.27372 




AA046593 


ESTs 


Hs.2895g 




M410480 


ESTs 


Hs.30089 




D45304 


ESTs 


Hs.31595 


10 


M90657 


transmembrane 4 superfamily member 1 


Hs.3337 




AA010163 


upstream requlatory element bindinq protein 1 


Hs.3383 




AA1 36353 


ESTs 


Hs.38022 






pirin 


Hs.38842 




U84573 


procollaqen-lysine; 2-oxoqlutarate 5-dioxyqenase (lysine 


Hs.41270 


15 


X60486 


H4 histone family; member G 


H 8.46423 




AA1 32969 


metalloprotease 1 (pitrilysin family) 


Hs.4812 




AA1 14250 


KIAA0512 gene product 


Hs.48924 




F 13782 


LIM binding domain 2 


Hs.4980 




AA283035 


ESTs; Weakly similar to !!!! ALU SUBFAMILY J WARNING 


Hs.54813 


20 


AB002301 


Human mRNA for KIAA0303 gene; partial cds 


Hs. 54985 




AA056731 


Sjogren syndrome antigen A2 (60kD; ribonucleoprotein 


Hs.554 




U68019 


MAD (mothers against decapentaplegic; Drosoptiila) homolog 3 


Hs.211578 




H99198 


ESTs; Moderately similar to THYMOSIN BETA-4 [H.sapiens] 


Hs.56145 




AA598702 


bone morphoqenetic protein 6 


Hs.6101 


25 


N77151 


Homo sapiens mRNA for KIAA0799 protein; partial cds 


Hs.61538 




AA505133 


ESTs 


Hs.62273 




AB000584 


prostate differentiation factor 


Hs.1 16577 




D12763 


interleukin 1 receptor-like 1 


Hs.66 




AA253193 


ESTs 


Hs.6631 


30 


AA432248 


ESTs 


Hs.6738 




AA083572 


v-ral simian leukemia viral oncogene homoloq A (ras related) 


Hs.6906 




AA479713 


ESTs 


Hs.71962 




L40395 


Homo sapiens clone 23689 mRNA; complete cds 


Hs.170001 




X52947 


qap junction protein; alpha 1; 43kD (connexin 43) 


Hs. 74471 


35 


W80846 


vesicle-associated membrane protein 5 (myobrevin) 


Hs.74669 




M34539 


FK506-bindinq protein 1A (12kD) 


Hs.752 




D67029 


SEC14 (S. cerevisiae)-like 


Hs.75232 




U09587 


glycyl-tRNA synthetase 


Hs.75280 




M85289 


"Human heparan sulfate proteoglycan (HSPG2) mRNA, complete 


Hs.211573 


40 


D 10522 


myristoylated alanine-rich protein kinase C substrate (MARCKS; 


Hs.75607 




W84712 


calumenin 


Hs.7753 




029992 


tissue factor pathway inhibitor 2 


Hs.78045 
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Complete Title 


UniGenelD(1 1/29/99) 




L34657 


platelet/endothelial cell adhesion molecule (CD31 antigen) 


Hs.78146 




S78569 


laminin; alpha 4 


Hs.78672 




D43636 


Human mRNA for KIAA0096 gene; partial cds 


Hs.79025 




U97188 


IGF-II mRNA-bindlnq protein 3 


Hs.79440 


5 


AA487558 


ESTs 


Hs.8135 




M28882 


"Human MUC18 glycoprotein mRNA, complete cds" 


Hs.211579 




X70683 


SRY (sex determining region Y)-box 4 


Hs.83484 




X14787 


thrombospondin 1 


Hs.87409 




AA236324 


ESTs; Weakly similar to !!!! ALU CLASS A WARNING ENTRY !!ll 


Hs.92381 


10 


C 15324 


ESTs 


Hs.93668 




AA452000 


ESTs 


Hs.94030 






collagen-binding protein 2 (colligen 2) 


Hs.9930 




D00596 


Homo sapiens gene for thymidylate synthase; exons 1 ; 2; 3; 4; 5; 


Hs. 196351 




D11428 


peripheral myelin protein 22 


Hs. 103724 


15 


D13640 


major histocompatibility complex; class 1; C 


Hs. 183618 




D14874 


adrenomedullin 


Hs,394 




D26129 


ribonuclease; RNase A family; 1 (pancreatic) 


Hs.78224 




D28476 


thyroid hormone receptor interactor 12 


Hs. 138617 




D86425 


Homo sapiens mRNA for nidogen-2 


Hs.82733 


20 


D86983 


Human mRNA for KIAA0230 gene; partial cds 


Hs. 118893 




D87953 


N-myc downstream regulated 


Hs .75789 




HG1862-HT1897 


Calmodulin Type 1 






HG2614-HT2710 


"Collagen, Type Viii, Alpha 1" 






HG2639-HT2735 


Single-Stranded Dna-Binding Protein Mssp-1 




25 


HG2855-HT2995 


"Heat Shock Protein, 70 Kda (Gb:Y00371)" 






HG3044-HT3742 


"Fibronectin, Alt. Splice 1" 






HG3342-HT3519 


Idl 






HG3543-HT3739 


Insulin-Like Growth Factor 2 






HG4069-HT4339 


Monocyte Chemotactic Protein 1 




30 


HG417-HT417 


Cathepsin B 






J03764 


plasminogen activator inhibitor; type 1 


Hs.82085 




L06797 


chemokine (C-X-C motif); receptor 4 (fusin) 


Hs.89414 




L08246 


myeloid cell leukemia sequence 1 (BCL2-related) 


Hs.86386 




L12711 


transketolase (Wernicke-Korsakoff syndrome) 


Hs.89643 


35 


LI 3977 


prolylcarboxypeptidase (angiolensinase C) 


Hs.75693 




L15388 


"Human G protein-coupled receptor kinase (GRK5) mRNA, 






L19871 


activating transcription factor 3 


Hs.460 






Human leukemia virus receptor 1 (GLVR1) mRNA: complete cds 


Hs. 78452 




L42176 


four and a half LIM domains 2 


Hs.8302 


40 


L49169 


Human G0S3 mRNA: complete cds 


Hs.75678 




L76380 


calcitonin receptor-like 


Hs.152175 




M15990 


v-yes-1 Yamaguchi sarcoma viral oncogene homolog 1 


Hs. 194148 




M23254 


calpain; large polypeptide L2 


Hs. 76288 
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M24736 


selectin E (endothelial adhesion molecule 1) 


Hs.89546 




M26576 


collagen; type IV; alpha 1 


Hs.119129 




M27396 


asparaglne synthetase 


Hs.75692 




M31166 


pentaxin-related gene; rapidly induced by IL-1 beta 


Hs.2050 




M31994 


"Homo sapiens aldehyde dehydrogenase (ALDH1) Qene, exon 13 






M32334 


intercellular adhesion molecule 2 


Hs. 83733 




M 35878 


insulin-like growth factor binding protein 3 


Hs.77326 




M36429 


postmeiotic segregation increased 2-like 12 


Hs,89672 




M 57730 


ephrln-A1 


Hs.1624 


10 


M57731 


GR02 oncogene 


Hs. 75765 




M60858 


nucleolin 


Hs.79110 




M62994 


fllamin B; beta (actin-binding protein-278) 


Hs.81008 




M68874 


"Human phosphatidylcholine 2-acylhydrolase (cPLA2) mRNA, 






M69043 


nuclear factor of ka aliht ol e tide ene enhancer in B cells 


Hs 81328 


15 


M74719 


transcri tion factor 4^^ 9 P yP P S 






M75126 


hexokinase"l "'^ 


ns. 1 1 oo^o 




M84349 


CD59 antigen pi 8-20 (antigen identified by monoclonal antibodies 
16.3A5; EJ16; EJ30; EL32 and G344) 


Hs. 11 9663 




M92843 


zinc finger protein homologous to Zfp-36 in mouse 


Hs. 198309 




M92934 


connective tissue growth factor 


Hs.75511 


20 


M93056 


protease inhibitor 2 (anti-elastase); monocyte/neutrophil 


Hs. 183583 




M94856 


fatty acid binding protein 5 (psoriasis-associated) 


Hs. 1531 79 




M95787 


transgelin 


Hs.75777 




S76965 


Protein kinase inhibitor [human; neuroblastoma cell line 


Hs. 75209 




S81914 


DIFFERENTIATION-DEPENDENT GENE 2 


Hs. 76095 


25 


U03057 


singed (Drosophila)-like (sea urchin fascin homolog like) 


Hs. 118400 




U03100 


catenin (cadherin-associated protein); alpha 1 (102kD) 


Hs. 1784 52 




U03877 


EGF-containing fibulin-like extracellular matrix protein 1 


Hs.76224 




U08021 


nicotinamide N-methvltransferase 


Hs. 76669 




U14391 


myosin IC 


Hs.82251 


30 


U31384 


guanine nucleotide binding protein 11 


Hs.83381 




U32944 


dynein; cytoplasmic; light polypeptide 


Hs.5120 




□40369 


"Human spermidine/spermine N 1 -acetyltransferase (SSAT) gene, 






U41767 


"Human metargidin precursor mRNA, complete cds" 






U48959 


Homo sapiens myosin light chain kinase (MLCK) mRNA; 


Hs.75950 


35 


U51010 


"Human nicotinamide N methyllransferase gene exon 1 and 5' 






U51478 


ATPase; Na+/K+ transporting; beta 3 polypeptide 


Hs.76941 




U53445 


Human ovarian cancer downregulated myosin heavy chain 
homolog (Doc1 ) mRNA; complete cds 


Hs. 15432 




U 59289 


cadherin 13; H-cadherin (heart) 


Hs.63984 




U59423 


MAD (mothers against decapentaplegic; Drosophila) homolog 1 


Hs.79067 


40 


U62015 


"Homo sapiens CyrSI mRNA, complete cds" 
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U63825 


Human he atitis delta anti en interactin rotein A (di A) mRNA- 
E 2 a£ L.2J :_ 


Hs 667 1 3 




U67963 


Human lysophospholipase homolog (HU-K5) mRNA; complete 


Hs.6721 




y73379 


Human cyclin-selective ubipuitin cannier protein mRNA; complete 


Hs. 93002 




— 


eukaryotic translation initiation factor 4 gamma; 2 


Hs. 183684 






microsomal Qlutathione S-transferase 2 


Hs.81874 






U81607 


kinase scaffold protein gravin 






U89942 


lysyl oxidase-like 2 


Hs.83354 




X04412 


gelsolin (amyloidosis; Finnish type) 


Hs.80562 




X06985 


heme oxygenase (decyclinq) 1 


Hs, 75967 


10 


X07820 


matrix metaiioproteinase 10 (stromelysin 2) 


Hs.2258 




X12876 


keratin 18 


Hs.65114 






DEAD/H (Asp-Glu-Ala-Asp/Hls) box polypeptide 5 (RNA helicase; 


Hs. 76053 




^2M? 


early growth response 1 






X53416 


filamin A; alpha (actin-binding protein-280) 


Hs,76279 


15 


X54489 


GR01 oncogene (melanoma growth stimulating activity; alpha) 


Hs.789 




X54925 


matrix metaiioproteinase 1 (interstitial collagenase) 


Hs.83169 




X57206 


inositol 1 ;4;5-trisphosphate 3-klnase B 


Hs.78877 




X59798 


cyclin DI (PRAD1; parathyroid adenomatosis 1) 


Hs.82932 




X60957 


tyrosine kinase with immunoglobulin and epidermal growth factor 


Hs.78824 


20 


X65965 


H. sapiens SOD-2 gene for manganese superoxide dismutase 






X69111 




. . . 

inhibitor of DNA binding 3; dominant nepa^ive helix-loop-helix 


Hs. 76884 




X70940 


eukaryotic translation elonc^ation factor 1 alpha 2 


HS:2642 




X87838 


catenin (cadherin-associated protein)," b6td 1 (88kD) 


Hs. 171271 






thioredoxin reductase 1 


Hs. 13046 


25 


X97^8 


H. sapiens PTX3 c^ene promotor region 






Y00815 


protein tyrosine phosphstsse,* receptor typei F 


Hs.7521 6 




AA30371 1 




Hs. 144700 






eph^r»i-B1 


Hs. 1 56044 




AA025351 


ESTs 


Hs. 134797 


30 


AA027050 


ESTs 


Hs.31189 




AA029462 


ESTs 


Hs. 17235 




AA045136 


ESTs 


Hs.22575 




AA047437 


ESTs 


Hs.22968 




AA054087 


phospholipase A2; group IVC (cytosolic; calcium-independent) 


Hs.18858 


35 


AA071089 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SC WARNING 


Hs.187932 




AA1 56450 


ESTs; Weakly similar to Similar to Rat trg gene product 


Hs.8982 




AA1 87490 


ESTs 


Hs.21941 




AA1 95031 


ESTs; Moderately similar to PROBABLE G PROTEIN-COUPLED 
RECEPTOR APJ [H.sapiens] 


Hs.9305 




AA205724 


ESTs 


Hs.10119 


40 


AA227926 


ESTs 


Hs.6682 



127 



wo 01/11086 



PCT/USOO/22061 





Exemplar 
Accession 


Complete Title 
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AA227986 


ESTs 


Hs.25329 




AA234743 


ESTs 


Hs.22120 




AA253216 


ESTs 


Hs.22283 




AA256210 


oncomodulin 


Hs. 199134 


5 


AA256268 


ESTs 


Hs. 10283 




AA279397 


ESTs; Moderately similar to flbronectin [H.sapiens] 


Hs.25001 




AA292379 


ESTs; Moderately similar to ALU SUBFAMILY SQ WARNING 


Hs.20340 




AA292717 


ESTs; Weakly similar to JM2 [H.sapiensl 


Hs.7891 




AA346551 


ESTs 


Hs.23457 


10 


AA400292 


ESTs 


Hs.23786 




AA404338 


ESTs 


Hs,21812 




AA4 12284 


poliovirus receptor 


Hs. 171 844 




AA423987 


ESTs 


Hs.7567 




AA428594 


ESTs 


Hs.21321 


15 


AA430108 


ESTs 


Hs.6019 




AA431462 


ESTs 


Hs.28329 




AA431470 


ESTs; Weakly similar to CAMP-DEPENDENT PROTEIN KINASE 
INHIBITOR; MUSCLE/BRAIN FORM FH.sapiens] 


Hs.3407 




AA443756 


ESTs; Moderately similar to (deflfne not available 4105275) 


Hs 6673 




AA449479 


ESTs* Highly similar to (defline not available 5106787) 




20 


AA459916 


bradykinin receptor B2 


Hs.25021 




AA465226 


ESTs 


Hs.28631 




AA478778 


ESTs 


Hs. 16450 




AA479037 


ESTs 


Hs.7961 




AA482597 


ESTs; Highly similar to (defline not available 4704739) 


Hs.26054 


25 


AA487561 


ESTs; Highly similar to RAS-RELATED PROTEIN RAB-1A 


Hs.9813 




AA489245 


ESTs; Weakly similar to spenn specific protein [H.sapiens] 


Hs.5682 




AA504110 


ESTs 


Hs.18063 




AA520989 


ESTs; Highly similar to SERINE/THREONINE PROTEIN 
PHOSPHATASE PP1-BETA CATALYTIC SUBUNIT FH sapiens] 


Hs.9195 




AA599434 




Hs. 25035 


30 


AA608649 


Homo sapiens clone 23742 mRNA; partial cds 


Hs.6354 




AA609519 


ESTs 


Hs.26458 




351069 


Human isolate JuSo MUC18 glycoprotein mRNA (3' variant); 


Hs. 18571 8 




J97519 


podocalyxin-like 


Hs. 16426 




/V28391 


proliferation-associated 2G4; 38kD 


Hs.5181 


35 


AA035638 


Homo sapiens mRNA; cDNA DKFZp564F053 (from clone 


Hs.71968 




AA083514 


ESTs 


Hs.68301 




AA121315 


ESTs 


Hs.70823 




AA147186 


ESTs 


Hs,92387 




AA156125 


ESTs 


Hs.72116 


40 


AA1 88932 


ESTs 


Hs.85640 
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AA2 19653 





Hs.87125 




5—- 


Hs.42699 






Hs. 13233 


H48032 


ESTs 




H82117 


ESTs 


Hs.28043 


N3g584 


ESTs 


Hs. 17404 


N54067 


Homo sapiens mRNA for NIK; partial cds 


Hs.3628 


N59858 


ESTs 


Hs.33032 


N90933 


ESTs 


Hs.4867 


N93764 


ESTs; Moderately similar to Mil ALU CLASS C WARNING ENTRY 


Hs.10175 


R26124 


ESTs 


Hs.24024 


R27957 


ESTs 


Hs.24230 


R55470 


ESTs; Moderately similar to K02E10.2 [C.elegans] 


Hs.11067 


T16550 


ESTs; Highly similar to vacuolar protein sorting homolog h-vps45 


Hs.6650 


T26674 


ESTs- Weakly similar to neuronal thread protein AD7c NTP 


Hs 6966 


T57112 


••yc20g1 1 .s1 Stratagene lung (#937210) Homo sapiens cDNA 

clone IMAGE:81284 3", mRNA sequence." 


Hs.8881 


1 OOf\J\J 




Hs. 173374 


T90527 


ESTs 


Hs.7890 


W42789 


ESTs 


Hs.31446 


W60002 


plastin 3 (T isoform) 


Hs.4114 


W78175 


ESTs 


Hs. 17901 


W84768 


ESTs 


Hs. 141 742 


W94427 


ESTs; Weakly similar to Na;K-ATPase Qamma subunit 


HS;3807 


AA2532 1 7 




"^•'^^^^^ 


AA426573 


"^STs ~~ 


Hs.41135 


AA432374 


— 

S 


Hs. 48029 




1§I2 . 


Hs. 74313 


AA478771 




Hs. 50841 


AA482594 


ESTs ~" 


Hs. 62684 


AA490588 


ESTs 


Hs.43118 


D59570 


ESTs 


Hs.17132 


H88157 


ESTs 


Hs.41105 


H94648 


ESTs 


Hs.41995 


H97538 


ESTs 


Hs.42392 


H98670 


ESTs; Weakly similar to (defline not available 4884081) 


Hs.49753 


N22107 


ESTs; IVIoderately similar to !!!! ALU SUBFAMILY SC WARNING 


Hs.1 72241 


W38197 


Accession not listed in Genbank 




W80814 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SB WARNING 


Hs. 196785 


AA287347 


ESTs 


Hs.1 05088 


AA402799 | 


ESTs 


Hs.1 82538 
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Complete Title 
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AA404418 


EST 


Hs. 144953 


AA425107 





Hs.97016 


AA425435 


ESTs; Moderately similar to I!!! ALU SUBFAMILY J WARNING 


Hs.98438 


AA442872 


ESTs 


Hs.110771 


AA452860 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SP WARNING 


Hs.197214 


AA488687 


ESTs; Moderately similar to III! ALU SUBFAMILY SQ WARNING 


Hs. 190307 


AA599674 


ESTs; Weakly similar to ORF [D.melanoqaster] 


Hs. 1081 15 


F13673 


ESTs 


Hs.99769 


H99093 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide (72kD) 


Hs.6179 


N22495 


"yw35g11.s1 Morton Fetal Cochlea Homo sapiens cDNA clone 
IMAGE:254276 3', mRNA sequence." 


Hs.102415 


N23031 


myosin; heavy polypeptide 7; cardiac muscle; beta 


Hs.929 


R15740 


carbohydrate (chondroitin 6/keratan) sulfotransferase 1 


Hs. 104576 


R39610 


calpain; larqe polypeptide L2 


Hs. 76288 


W45560 


ESTs 


Hs 1 02541 


Z39833 


H.sapiens mRNA for Rho6 protein 


Hs. 124940 


Z40583 


ESTs 


Hs. 101259 


AA825437 


ESTs 




R66613 


Homo sapiens mRNA: cDNA DKFZp564F053 (from clone 




AA868063 


carbohydrate (chondroitin 6/keratan) sulfotransferase 1 




AA1 28075 


"zl16d08.r1 Saares_pregnant_uterus NbHPU Homo sapiens 
cDNA clone IMAGE :502095 5", mRNA sequence." 




N66570 






AI051390 


ESTs 




AA627122 


ESTs 




X02761 


fibronectin 1 


Hs.1 18162 


AF010193 


MAD (mothers against decapentapleqic; Drosophila) homoloq 7 


Hs. 100602 


AA1 49044 


ESTs; Highly similar to the KIAA01 95 gene is expressed 


Hs.1 0086 


U82108 


solute carrier family 9 (sodium/hydrogen exchanger); isoform 3 


Hs.101813 


D78676 


ESTs; Moderately similar to (defline not available 4529890) 


Hs.1 05509 


L35240 


enigma (LIM domain protein) 


Hs.1 02948 


AA598737 


lactate dehydrogenase B 


Hs.180414 


R69417 


ESTs 


Hs 107055 


AA232837 


ESTs; Weakly similar to Human pre-mRNA cleavage factor 1 68 
kDa subunit [H.sapiens] 


Hs.1 07125 


N72695 


ESTs 


Hs.1 08557 


M30257 


vascular cell adhesion molecule 1 


Hs.1 09225 


M96843 


nhibitor of DNA binding 2; dominant negative helix-loop-helix 
protein 


Hs.109617 


X68277 


dual specificity phosphatase 1 


Hs.171695 


AA292440 


myeloid differentiation primary response 


Hs.1 10571 


J03040 


secreted protein; acidic; cysteine-rich (osteonectin) | 


Hs.1 11779 



130 



wo 01/11086 



PCTAJSOO/22061 





Exemplar 
Accession 


Complete Title 


UniGenelD(1 1/29/99) 




AA228107 


ESTs 


Hs. 54642 




AA449789 


connective tissue growtli factor 


Hs.75511 




W01367 


ESTs 


Hs. 170980 




AA610116 


ESTs; Highly similar to (defllne not available 4325180) 


Hs.11663 


5 


AA258308 


Homo sa iens mRNA' cDNA DKFZ 564F053 from cl n 


Hs.1 65618 




AA460273 


Homo IT^ens mRNA fo°KIAAoTl7'^T^ ^^"^""^ ^'""^ 

omo sapiens m or^ 517 protein, partial cds 


Hs. 12372 




AA2 86710 




Hs.1 3131 




T68873 


metLl^hionein^l l"'^ '^"^ 


Hs.1 43289 




D63476 


PAK interact^nq exchange factor beta 


Hs.1 72813 


10 


M62403 


insulin-like growth factor-binding protein 4 


Hs.1516 




X55740 


5' nucleotidase (CD73) 


Hs.1 53952 




L 10284 


calnexin 


Hs.1 55560 




AA243278 


ribosomal protein; mitochondrial; LI 2 


Hs.1 09059 




AA430032 


pituitary tumor-transforminq 1 


Hs. 159626 


15 


HI 6402 


ESTs 


Hs 17121 




D5971 1 


ESTs 


Hs.1 7132 




T94452 


"ye36g7.s1 Stratagene lung (#93721) Homo sapiens cDNA clone 
IMAGE: 11 9868 3' mRNA seguence" 






AA431571 


ESTs 


Hs. 1 7894 




R79356 


Homo sapiens mRNA for KIAA0544 protein, partial cds 


Hs.1 9280 


20 


AA280375 




Hs.1 9928 




Z49269 


small inducible cytokine subfamily A (Cys-Cys); member 14 


Hs.20144 




Z41740 


ESTs 


Hs.24462 




AA121543 


Homo sapiens mRNA for KIAA0758 protein; partial cds 


Hs.22039 




J05008 


endothelin 1 


Hs.2271 


25 


AA101878 


ESTs 


Hs.22793 




T35341 


ESTs; Highly similar to (defline not available 4519883) 


Hs.22880 




yj87590 


ESTs*^'^"^^ 


Hs. 23037 




AA256153 


ESTs ~~ 


^^^^^^^ 




W74533 


Homo sa iens mRNA for KIAA0786 rotein ' 

omo sapiens m or protein, partial cds 


^^■^'^^'^^ 


30 


J25997 




Hs. 25590 






V fosFBJ murine osteosarcoma viral onco n 


^^•^^^"^^ 




V01512 


v fos FBJ murine osteosarcoma viral oncogene homolo^ 


Hs.25647 




VOI512" 


V fos FBJ murine osteosarcoma viral oncogene homolo^ 


^^■^^^^^ 




^01512 


V fos FBJ murine osteosarcoma viral onc°^^r 


Hs.25647 


35 


X56681 


un°D protoTncogene"^^'^'^""'^ ^"^^ oncogene omo og 


Hs.2780 




flJM 61292 


nterferon: alpha-inducible protein 27 






flj\491465 


ESTs 


Hs.28792 




^A046593 


ESTs 


Hs.28959 




D50914 


Human mRNA for KIAA0124 gene; partial cds 


Hs.30736 


40 


D45304 


ESTs 


Hs.31595 




VI90657 


transmembrane 4 superfamily member 1 


Hs.3337 
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Complete Title 


UniGenelD(11/29/99) 




W69127 


ESTs; Weakly similar to zinc finger protein ZNF191 [H. sapiens] 


Ms. 3449 




AA316186 


ESTs; Highly similarto (defline not available 4262136) 


Hs.34549 




AA384503 


ESTs 


Hs. 179260 




AA1 36353 


ESTs 


Hs. 38022 


5 


AA044755 


ESTs; Weakly similarto !!!! ALU SUBFAMILY SX WARNING 


Hs. 173705 




U84573 


procollagen-lysine; 2-oxoglutarate 5-dioxygenase (lysine 
hydroxylase) 2 


Hs.41270 




AA05891 1 


ESTs; Weakly similarto membrane glycoprotein fM.musculus] 


Hs.4193 




AA620962 


dynein; cytoplasmic; light intermediate polypeptide 2 


Hs.44251 




AA285290 


small EDRK-rich factor 2 


Hs.44499 


10 


X60486 


H4 histone family; member G 


Hs.46423 




R31641 


ESTs 


Hs. 197 148 




AA489190 


ESTs 


Hs.48320 




F13782 


LIM binding domain 2 


Hs.4980 




AA257993 


Janus kinase 1 (a protein tyrosine kinase) 


Hs.50651 


15 


M24283 


intercellular adhesion molecule 1 (CD54); human rhinovirus 


Hs. 168383 




AA443114 


ESTs; Weakly similarto PIM-1 PROTO-ONCOGENE 
SERINE/THREONINE-PROTEIN KINASE fH.sapiensl 


Hs.5326 




T35289 


casein kinase 1; alpha 1 


Hs.195206 




N23817 


Homo sapiens clone 23675 mRNA sequence 


Hs.5807 




AA0471 51 


ESTs 


Hs.5897 


20 


N77151 


Homo sapiens mRNA for KIAA0799 protein; partial cds 


Hs.61638 




AA480074 


ESTs 


Hs.62206 




Y00787 


interleukin 8 


Hs.624 




T99789 


ESTs 


Hs.64313 




W84341 


tissue inhibitor of metalloproteinase 2 


Hs.6441 


25 


L09209 


amyloid beta (A4) precursor-like protein 2 


Hs.64797 




D12763 


interleukin 1 receptor-like 1 


Hs.66 




T16484 


ESTs 


Hs.6607 




AA253193 


ESTs 


Hs.6631 




AA432248 


ESTs 


Hs.6738 


30 


X82200 


stimulated trans-acting factor (50 kDa) 


Hs.68054 




AA083572 


v-ral simian leukemia viral oncogene homolog A (ras related) 


Hs.6906 




L00352 


low density lipoprotein receptor (familial hypercholesterolemia) 


Hs. 181 182 




N75791 


ESTs 


Hs.7153 




X57579 


H.sapiens activin beta-A subunit (exon 2) 




35 


X02612 


cytochrome P450; subfamily 1 (aromatic compound-inducible); 


Hs.72912 




H44631 


mmediate early protein 


Hs.737 




AA090257 


superoxide dismutase 2; mitochondrial 


Hs.1 77781 




X83703 


H.sapiens mRNA for cytokine inducible nuclear protein 


Hs.74019 




L40395 


Homo sapiens clone 23689 mRNA; complete cds 


Hs.1 70001 


40 


^A227913 1 


ESTs 


Hs.1 98456 
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Accession 


Complete Title 


UniGeneIDd 1/29/99) 




X52947 


gap junction protein; alpha 1 ; 43kD (connexin 43) 


Hs.74471 




Ml 1313 


alpha-2-macroqlobulin 


Hs.74561 




L14837 


tight junction protein 1 (zona occludens 1 ) 


Hs.74614 




M60721 


"Human homeobox gene, complete cds" 




5 


D90209 


activating transcription factor 4 (tax-responsive enhancer element 


Hs.181243 




T67986 


"yc28e12.s1 Stratagene liver (#937224) Homo sapiens cDNA 
clone IMAGE:82030 3' similar to gb:X14723 CLUSTERIN 


Hs.75106 




AA148318 


Human mRNA for KIAA0069 qene; partial cds 


Hs.75249 




U97105 


dihydropyrimidinase-like 2 


Hs.173381 




T25747 


H.sapiens OZF mRNA 


Hs.75471 


10 


K02574 


Accession not listed in Genbank 






D78577 


tyrosine 3-monooxygenase/tryptophan 5-monooxygenase 
activation protein; eta polypeptide 


Hs.75544 




X53331 


matrix Gla protein 


HS.75742 




S73591 


uprecjulated by 1;25-dihvdroxyvitamin D-3 


Hs. 179526 




X95735 




Hs.75873 


15 


LI 6862 


G protein-coupled receptor kinase 6 


Hs.76297 




U44975 


Homo sapiens Kruppel-like zinc finger protein Zf9 mRNA; 


Hs.76526 




M 97796 


inhibitor of DMA binding 2; dominant negative helix-loop-heiix 


Hs.180919 




U86782 


26S proteasome-associated padi homolog 


Hs. 178761 




AA099391 


ESTs 


Hs.77310 


20 


M 19267 


tropomyosin 1 (alpha) 


Hs.77899 




D29992 


tissue factor pathway inhibitor 2 


Hs.78045 




L19314 


phosphorylase kinase; beta 


Hs. 19521 7 




S78569 


laminin; alpha 4 


Hs.78672 




U28811 


"Human cysteine-rich fibroblast growth factor receptor (CFR-1 ) 




25 


L77886 


protein tyrosine phosphatase; receptor type; K 


Hs.79005 




C14407 


neuronal tissue-enriched acidic protein 


Hs.79516 




M60278 


diphtheria toxin receptor (heparin-binding epidermal growrth 


Hs.799 




R81509 


splicing factor; arginine/serine-rich 11 


Hs. 184571 




AA487558 


ESTs 


Hs.8135 


30 


D86962 


KIAA0207 gene product 


Hs.81875 




AA478971 


disabled (Drosophila) homolog 2 (mitogen-responsive 


Hs. 81988 




D50683 


transforming growth factor; beta receptor II (70-80kD) 


Hs. 82028 




U56637 


capping protein (actin filament) muscle Z-line; alpha 1 


Hs. 184270 




M61199 


Human cleavage signal 1 protein mRNA: complete cds 


Hs.82767 


35 


M28882 


"Human MUC1 8 glycoprotein mRNA. complete cds" 






X15183 


CDW52 antigen (CAMPATH-1 antigen) 


Hs. 180532 




S53911 


CD34 


Hs.85289 
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Accession 


Complete Title 


UniGenelD(1 1/29/99) 




U20734 


Human transcription factor junB (junB) qene; 5' region and 


Hs. 198951 




D28235 


prostaglandin-endoperoxide synthase 2 (prostaglandin G/H 


Hs.92309 




AA236324 


ESTs; Weakly similar to !!!! ALU CLASS A WARNING ENTRY !!!! 


Hs.92381 




AA1 48923 


Homo sapiens mRNA for DEPP (decidual protein induced by 


Hs.93675 


5 


AA174183 


ESTs 


Hs.93872 




AA45631 1 


ESTs' W63klv similsrto "" ALU CLASS A WARNING ENTRY "" 


Hs.93961 




L08069 


heaUhock protein, DNAJ-like 2 


HS:94 




AA452000 




Hs. 94030 




AA282140 


ESTs 


Hs.9587 


10 


J02854 


myosin requlatory light chain 2; smooth muscle Isoform 


Hs.9615 




AA442054 


phosphollpase C; qamma 1 (formerly subtype 148) 


Hs.993 




AB000450 


vaccinia related kinase 2 






AB002380 


KIAA0382 protein 






AB003103 


roteasome rosome- macro ain) 26S subunit- non ATPase- 12 






AB004884 


to'^us?ed'ire^kinase2^ macropain) su uni . non- ase^ 






AF000573 


homogentisate 1^2-dioxygenase (homogentisate oxidase) 






AF008937 








AF009301 


similar to S. cerevisiae SSM4 






AF009368 


cAMP responsive element binding protein 3 (luman) 




20 


D00591 


chromosome condensation 1 






D00760 


proteasome (prosome; macropain) subunit; alpha type; 2 






D11139 


tissue inhibitor of metalloproteinase 1 (erythroid potentiating 






D14657 


KIAA0101 gene product 






D14878 


D123 gene product 




25 


D17716 


mannosyl (alpha-1;6-)-glycoproteln 
beta-1;6-N-acetyl-glucosamlnyltransferase 






D21090 


RAD23 (S. cerevisiae) homolog B 






D26135 


diacylglycerol kinase; gamma (90kD) 






326528 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 7 (RNA helicase' 












30 


^1762 


K^/WjOsTq^enrpro'd^ 






D31765 


KIAA0061 protein 






331888 


KIAA0071 protein 






338128 


prostaglandin 12 (prostacyclin) receptor (IP) 






D38500 


postmeiotic segregation increased 2-like 4 




35 


D38551 


RAD21 (S. pombe) homolog 






D42087 


KIAA01 18 protein 






D49396 


antioxidant protein 1 






D55640 








D53391 


platelet-activating factor acetylhydrolase; isoform lb; qamma 
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Complete Title 


UniGeneIDd 1/29/99) 




D63477 


KIAA0143 protein 






D63483 


acetyl LDL receptor; SREC 






D64015 


TIA1 cytotoxic qranule-associated RNA-bindInq protein-like 1 






D79990 


KIAA0168 cjene product 






D79997 


KIAA0175 gene product 






D80010 


KIAA0188 protein 






D84276 


CD38 antigen (p45) 






D86425 


nidogen 2 






D86978 


KIAA0225 protein 




10 


D87012 


Human (lambda) DNA for immunoqiobin liqht chain 






□87075 


solute can-ier family 23 (nucleobase transporters); member 1 






D87432 


solute carrier family 7 (cationic amino acid transporter; y+ 






D_744 


topolsomerase (DNA) II binding protein 






D87845 


platelet-activating factor acetylhydrolase 2 (40kD) 






HG1 098-HT1 098 








HG21 67-HT2237 








HG241 5-HT251 1 








HG2825-HT2949 








HG2887-HT3031 






20 


HG4660-HT5073 








HG4704-HT5146 








HG884-HT884 








HG919-HT919 








J00212 






25 


J04029 


keratin 10 (epidermolytic hyperkeratosis; keratosis palmarls et 






J04031 


methylenetetrahydrofolate dehydrogenase (NADP+ dependent); 
methenyltetrahydrofolate cyclohydrolase; formyltetrahydrofolate 






J04088 


topolsomerase (DNA) II alpha (170kD) 






J04543 


annexln A7 






L06139 


TEK tyrosine kinase; endothelial (venous malformations; multiple 
cutaneous and mucosal) 




30 


L07540 


replication factor C (activator 1 ) 5 (36.5kD) 






08895 


MADS box transcription enhancer factor 2; polypeF>tlde C 








gastrulatlon brain homeo box 1 






^11353 


neurofibromin 2 (bilateral acoustic neuroma) 








myeloid/lymphoid or mixed-lineage leukemia (trithorax 

'Drosophila) homolog); translocated to; 2 




35 


.13800 








L 14922 


replication factor C (activator 1) 1 (145kD) 






L15189 


heat shock 70kD protein 9B (mortalin-2) 






L15388 


G protein-coupled receptor kinase 5 






LI 6895 


ysyl oxidase 




40 


L27476 


tight junction protein 2 (zona occludens 2) 
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Complete Title 
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L27624 


tissue factor pathway inhibitor 2 






L32976 


mitogen-activated protein kinase kinase kinase 1 1 






L33404 


kallikrein 7 (chymotryptic; stratum comeum) 






L35263 


mitogen-activated protein kinase 14 




5 


L37347 


solute carrier family 1 1 (proton-coupled divalent metal ion 






L40371 


thyroid hormone receptor interactor 4 






L40391 


Homo sapiens (clone s153) mRNA fragment 






L41607 


glucosaminyl (N-acetyl) transferase 2; l-branching enzyme 






L77566 


DiGeorge syndrome critical region gene DGSI 




10 


M 13928 


aminolevulinate; delta-; dehydratase 






Ml 3928 


aminolevulinate; delta-; dehydratase 






M14016 


uroporphyrinogen decarboxylase 






IVI14219 


decorin 






Ml 5796 


proliferating cell nuclear antigen 




15 


M21305 


Human alpha satellite and satellite 3 junction DNA sequence 






M22092 








M22898 


tumor protein p53 (Li-Fraumeni syndrome) 






M22995 


RAP1A; member of RAS oncogene family 






M23379 


RAS p21 protein activator (GTPase activating protein) 1 




20 


M24364 


major histocompatibility complex; class II; DQ beta 1 






M24400 


chymotrypsinogen B1 






M25753 


cyclin B1 






M27691 


cAMP responsive element binding protein 1 






M28213 


RAB2; member RAS oncogene family 




25 


M29550 


protein phosphatase 3 (fonneriy 28); catalytic subunit; alpha 






M29971 


O-6-methylguanine-DNAmethyltransferase 






M30269 


nidogen (enactin) 






M31158 


protein kinase; cAMP-dependent; regulatory; type II; beta 






M31166 


pentaxin-related gene; rapidly induced by IL-1 beta 




3 0 


\^31210 


endothelial differentiation; sphingolipid G-protein-coupled 
receptor; 1 






\^55420 


Epsilon ; IqE 






M59979 


prostaglandin-endoperoxide synthase 1 (prostaglandin GIH 






M62810 


transcription factor 6-like 1 (mitochondrial transcription factor 






M63838 


interferon; gamma-inducible protein 16 




35 


M64710 


Human CNP gene for C-type natriuretic peptide 






\^68874 








M74524 


ubiquitin-conjugating enzyme E2A (RAD6 homolog) 






M80254 


peptidylprolyl isomerase F (cyclophilin F) 






M81780 


sphingomyelin phosphodiesterase 1 ; acid lysosomal (acid 




40 


M81780 


sphingomyelin phosphodiesterase 1 ; acid lysosomal (acid 
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M81780 


sphingomyelin phosphodiesterase 1; acid lysosomal (acid 






M81780 


Homo sapiens acid sphingomyelinase (SMPD1] gene; complete 
cds; ORF*s 1-3; complete cds's 






M81 780 


Homo sapiens acid sphingomyelinase (SMPD1) gene; complete 






M83822 


ceN division cvcleTlikr^ "^^^ ^ 




5 


M 86934 


DNA sT'menrnumerous co ies- ex re — d 






M87338 


re licafon'facto7(r(aTtivato^^^^ ^'ordT^'^ ^^^^^^ ^^^^ ^^"^^ 






M96326 


azur^idin I^Lationic antknicrobial rotein 7 








TIAl'gltotoxic^ran^^^ ~~ 






M98833 


FrienPl^temtrXus^rteqration^ protein-ltke 1 




10 




srrsstin 3* rotinal (X-arr6stin) 






S72370 








S78569 


laminit^alpha°4^'^^^ 






S79873 


lysosomal-associated membrane protein 2 






S83325 


aspartate beta-hydroxylase 




15 


S83364 








S83365 








U01212 


olfactory marker protein (symbol provisional) 






U01922 


translocase of Inner mitochondrial membrane 8 e t h 






U02556 


t TOmTeTL°soc!a^e(rtestire^^ ^ ^^^^^'^ ^ 




20 


U02680 


rotein rros^rkbaseT'^"^^^^^^^^ — ~ 






U03272 


fibrilHn 2fcon"enital^contractur^ arachnodact 1 






U04209 


microfibriMrasLcia^terrote^^ 






U05237 


fetarAlzL^neranti'^n 






U07225 


triner^ic^rece tor'p2Y- G rotein cou led- 2 




25 


U07620 


mrtTeTact^vater rotein kinase'l O^""*^ ^ ' 






U09759 


mitoqen^ctivated protein kinase 9 






J09820 


alpha thalass6mia/m6ntal rstardation syndrome X-linked 
















ceTrlreTrTot!!^iM17kD) 

cen romere pro em ) ___ ^ 




3 0 


U14575 








U15173 


BCL2/adenwlrurE^B 1^^^ ctin ^'^""^^ ^'^'^"'^'^ ^ 






U 15932 


dual specifirtvphosphataseT^'^^'^^'^^ ^'^"^^'^ ^ 






U18291 


CDC16 (cell division cycle 16; S. cerevisiae; homoloq) 






U18300 


damage-specific DNA binding protein 2 (48kD) 




35 


U18383 


nuclear respiratory factor 1 






U20536 


caspase 6; apoptosis-related cysteine protease 






U21551 


branched chain aminotransferase 1 : cytosollc 






U23028 


eukaryotic translation initiation factor 2B; subunit 5 (epsilon; 






U23752 


SRY (sex-determining region Y)-box 1 1 




40 


U25435 


transcriptional repressor 






U25997 


stanniocalcin 
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zinc finger protein 169 






U28M1 
















U32315 


syntaxin 3A 






U32439 


repulator of G-protein sipnalling 7 








N-myc (and STAT) interactor 








necdin (mouse) homolog 






U 36764 


eul<aryotic translation initiation factor 3; subunit 2 (beta; 36kD) 






U39400 


chromosome 1 1 open reading frame 4 




10 


U 39657 


mitogen-activated protein kinase kinase 6 








proline arqinine-rich end leucine-rich repeat protein 






y4i^4 


a disintegrin and metalloproteinase domain 9 (meltrin gamma) 






U41813 


homeo box A9 






U41815 


nucleoporin 98I<D 




15 


U43286 


selenophosphate synthetase 2 








MAD (mothers against decapentapleqic; Drosophila) homolog 4 








small nuclear RNA activating complex; polypeptide 1; 43kD 






U4701J 


fibroblast growth factor 8 (androgen-induced) 









fibroblast growth factor 8 (androgen-induced) 




20 




fibroblast growth factor 8 (androgen-induced) 






LMTori 


fibroblast growth factor 8 (androgen-induced) 






U47077 


protein kinase: DNA-activated; catalytic polypeptide 






U48251 


protein kinase C binding protein 1 






U50535 


Human BRCA2 region; mRNA seguence CG006 




25 


U56833 


von Hippel-Lindau binding protein 1 






U58091 


CUMin 






U58837 


cyclic nucleotide gated channel beta 1 






U59289 


cadherin 13; H-cadherin (heart) 






U59863 






30 


U67122 


ubi^uitin'like Hs^nWnT^"^'^^^'^ ^^^^ act'^ator 






U67319 


cas ase T a o tosis related steine rotease 






U68019 


MAD^(rnothere°a a!nst deca ente^iric-D^^^^ h'l 

(mo ersagatns ecapen pegtc, rosop t a) homolog 3 — 






U69611 


a disintegrin and metalloproteinase domain 17 (tumor necrosis 
factor; alpha; converting enzyme) 






J 70322 


karyopherin (importin) beta 2 




35 


U73524 


ATP/GTP-binding protein 






U79267 


protein phosphatase 4; regulatory subunit 1 






U79291 


Human clone 23721 mRNA seguence 






U82671 


Homo sapiens clone LM1955 H105e3 gene; partial cds 






U82671 


zinc finger protein 185 (LIM domain) 




40 


U84573 


procollagen-lysine; 2-oxoglutarate S-dioxygenase (lysine 






U90914 


carboxypeptidase D 






U91316 


cytosolic acyl coenzyme A thioester hydrolase 






□91932 


adaptor-related protein complex 3; sigma 1 subunit 





138 



wo 01/11086 



PCT/USOO/22061 





Exemplar 
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UniGeneIDd 1/29/99) 




U96131 


Homo sapiens HPV16 E1 protein binding protein mRNA; 






U9701 8 


ecliinoderm microtubule-associated protein-like 






U97188 


IGF-II mRNA-binding protein 3 






\/00503 


collagen; type 1; alpha 2 








2;3-bisptiosphoglycerate mutase 






X06389 


synaptophysin 






X07496 


apoiipoprotein A-l 






X07820 


matrix metalloproteinase 10 (stromelysin 2) 








thrombospondin 1 




10 


X15525 


acid pliosphatase 2; lysosomal 








methylene tetrahydrofolate dehydrogenase (NAD+ dependent); 
methenyltetrahydrofolate cycloiiydrolase 






XI 6609 


ankyrin 1 ; erythrocytic 






X53586 


inteqrin; alpha 6 






X53586 


integrin; alpha 6 




15 


X53793 


multifunctional polypeptide similar to SAICAR synthetase and AIR 






X54936 


placental growth factor; vascular endothelial growth factor-related 






X55740 


5' nucleotidase (CD73) 






X57025 


Insulin-like growth factor 1 (somatomedin C) 






X60673 


adenylate kinase 3 




20 


X60673 


adenylate kinase 3 






AOU/UO 


dipeptidylpeptidase IV (CD26; adenosine deaminase complexinq 








wee1+ (S. pombe) homolog 






AOOVjy / 


Rhesus blood group; D antigen 






X63563 


polymerase (RNA) 11 (DNA directed) polypeptide B (140kD) 




25 


X64037 


general transcription factor IIF; polypeptide 1 (74kD subunit) 








hect domain and RLD 2 






Aoyo I o 


fms-related tyrosine kinase 4 






X70649 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 








retinoblastoma-binding protein 7 






<74987 


ATP-binding cassette; sub-family E (OABP); member 1 






"(83107 


BIMX non-receptor tyrosine kinase 








acylphosphatase 1; erythrocyte (common) type 






X85753 


cyclin-dependent kinase 8 






X87870 


hepatocyte nuclear factor 4; alpha 




35 


X89066 


transient receptor potential channel 1 






X89398 


uracil-DNA glycosylase 






X893g8 


uracil-DNA glycosylase 






X89399 


RAS p21 protein activator (GTPase activating protein) 3 






X89426 


endothelial cell-specific molecule 1 




40 


X91247 


thioredoxin reductase 1 






K91648 


H.sapiens mRNA for pur alpha extended 3'untranslated region 
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X92098 


coated vesicle membrane protein 




X92110 


H. sapiens mRNA for hcgVIII protein 




X94703 


RAB28; member RAS oncogene family 




X96506 


DR1 -associated protein 1 (negative cofactor 2 alpha) 




X97230 


killer cell immunoqiobulin-like receptor; three domains; long 




X98263 


M-phase phosphoprotein 6 




X98296 


ubiquitin specific protease 9; X chromosome (Drosophila fat 




X99584 


SMT3 (suppressor of mif two 3; yeast) homoloq 1 




Y00264 


amyloid beta (A4) precursor protein (protease nexin-ll; Alzheimer 




Y07566 


Ric (Drosophila )-like; expressed in many tissues 




Y07759 


myosin VA (heavy polypeptide 12; myoxin) 




Y07827 


butyrophilin; subfamily 3; member A1 




Y07867 






Y09443 


— 

alkylglycerone phosphate synthase 






H. sapiens mRNA for unknown protein 




Y12394 


karyopherin alpha 3 (importin alpha 4) 




Z11559 


iron-responsive element binding protein 1 




Z11695 


mitogen-activated protein kinase 1 




Z15005 


centromere protein E (312kD) 




Z46261 


H3 histone family; member A 




AA011243 


poly(rC)-binding protein 2 




AA018418 


ESTs; Weakly similar to type-1 protein phosphatase skeletal 
muscle glycogen targeting subunit [H. sapiens] 




AA018758 


ESTs 




AA018804 


Homo sapiens clone 23675 mRNA sequence 




AA031993 






AA0442 1 7 


ESTs; Weakly similar to collagen alpha 2(1) chain [R.norvegicus] 




AA046548 


SWI/SNF related; matrix associated; actin dependent regulator of 
chromatin, subfamily e, member 1 




AA057447 


ESTs; IVIoderately similar to alternatively spliced product using 




AA058376 


Sjogren syndrome antigen A2 (60kD; ribonucleoprotein 




AA083572 


v-ral simian leukemia viral oncogene homolog A (ras related) 




AA085696 






AA088744 






AA089688 


ES? 




AA091284 


ESTs; Highly similar to HSPC030 [H sapiens] 




AA092700 


ESTs 




AA092968 


ESTs 




AA094800 


eukaryotic translation initiation factor 3; subunit 7 (zeta; 66/67kD) 




AA100219 


ESTs 




AA1 14885 


ESTs 
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UniGenelD( 11/29/99) 




AA1 29547 


met proto-oncoqene (hepatocyte qrowth factor receptor) 






AA133016 


ESTs 






AA149507 


homolog of mouse quaking QKI (KH domain RNA binding protein) 






AA151005 


sperm surface protein 




5 


AA187101 








AA195179 


eukaryotc translation initiation factor 4A; isoform 2 






AA203 1 38 


low density lipoprotein receptor (familial hypercholesterolemia) 






AA203645 


Arg/Abl-lnteractlng protein ArgBP2 






AAZUO^oD 






10 


AA227621 


ESTs; Weakly similar to weak similarity to collagens [C.elegans] 






AA248283 


ESTs; Weakly similar to prostate-specific transglutaminase 






AA24961 1 


SH3-binding domain glutamic acid-rich protein 






AA282640 


ubiquitination factor E4B (homologous to yeast UFD2) 






AA287199 


KIAA0081 protein 




15 


AA3 13990 


DKFZP564M112 protein 






AA314256 


ESTs; Highly similar to CGI-94 protein fH.sapiensl 






AA3 14389 


ADP-ribosylatlon factor-like 5 






AA324364 


ESTs 






AA329211 


NS1 -associated protein 1 




20 


AA399187 


DKFZP434A043 protein 






AA421079 


ESTs; Weakly similar to Sox-like transcriptional factor fH.sapiensl 






AA422029 


ESTs 






AA425230 


Ras-GTPase-activating protein SH3-domain-binding protein 






AA447052 


KIAA0251 protein 




25 


AA452000 


Homo sapiens mRNA; cDNA DKFZp586E1624 (from clone 






AA456687 


ESTs 






AA487015 


Homo sapiens mRNA; cDNA DKFZp586L0120 (from clone 






AB002326 


Human mRNA for KIAA0328 gene; partial cds 






-BioB-3 






30 


C01527 


Mh — : 






C01714 


serum-lnduclble kinase 






C01811 


Homo sapiens clone 24921 mRNA sequence 






C02352 


ESTs; Highly similar to CGI-121 protein FH.saplens] 






C02375 


.§§12 




35 


C14448 


EST 








coproporphyrlnogen oxidase (coproporphyria; harderoporphyria) 






D25216 


KIAA0014 gene product 






D31352 


ESTs 






D58024 


ESTs; Weakly similar to KIAA0768 protein [H.sapiens] 




40 


D80897 


KIAA1 036 protein 






D82614 


ESTs 
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UniGenelD(1 1/29/99) 




D87845 


platelet-activating factor acetyl hydrolase 2 (40I<D) 






2§23ZZ 


msh (Drosophila) homeo box homoloq 2 






H06583 


cAMP responsive element bindinci protein-iike 2 






H40732 









H46617 









H56731 









H75570 


l§Is 






H78886 









H81241 


Kruppel-iike factor 8 




10 


L36531 


Inteprin; alpha 8 






M63154 


gastric intrinsic factor (vitamin B synthesis) 






M63180 


threonyl-tRNA synthetase 






M91504 


. 






N56191 


protocadherin 68 




15 


N78483 


ESTs; Weakly similar to F20D12.3 qene product [C.eleqansl 






N79268 


zinc finger protein 198 






R14652 


Homo sapiens PAC clone DJ0872F07 from 7q31 






R20459 


ESTs __ 






R22303 


ESTs; Weakly similar to putative p150 [H.sapiens] 




20 


R33779 


ESTs: Weakly similar to p40 [H.sapiens] 






R36553 


ESTs; Weakly similar to KIAA0681 protein TH.sapiens] 






R64534 


ESTs 






R66475 


ESTs 






R70621 


KIAA0896 protein 




25 


R79356 


KIAA0544 protein 






R84933 


ESTs 






AA007160 


Homo sapiens mRNA; cDNA DKFZp564D016 (from clone 






AA007234 


ESTs 






AAO 18409 






30 


AA025351 


ESTs 






AA027168 


KIAA0955 protein 






AA027317 








AA029423 


ESTs; Weakly similar to PUTATIVE PRE-MRNA SPLICING 
FACTOR RNA HELICASE [H .sapiens] 






AA031357 


ESTs; Weakly similar to N-WASP fH.sapiensl 




35 


AA045136 


ESTs 






AA053400 


ESTs 






AA055829 


ESTs; Weakly similar to !!!! ALU SUBFAMILY J WARNING 






AA065217 


ESTs 






^116054 


ESTs; Weakly similar to KIAA0638 protein [H .sapiens] 




40 


AA1 26311 


ESTs 






AA1 29390 


ESTs 






^A1 30273 


ESTs; Weakly similar to hypothetical protein; similar to 





142 



wo 01/11086 



PCT/USOO/22061 





Exemplar 
Accession 
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AA142919 


ESTs 






AA1 50205 


Kruppel-like factor 7 (ubiquitous) 






AA1 76867 


ESTs 






AA1 80321 


Homo sapiens (clone S164) mRNA; 3' end of cds 




5 


AA1 80487 


transforming; acidic coiled-coil containing protein 1 






AA 187634 


eukaryotic translation initiation factor 3; subunit 1 (alpha; 35I<D) 






AA1 95399 


ESTs 






AA234717 


ESTs 






AA234743 


ESTs 




10 


AA234957 


myotubularin related protein 1 






AA235604 


Homo sapiens clone 25007 mRNA sequence 






AA236559 


ESTs; Weakly similar to Ml! ALU SUBFAMILY SQ WARNING 






AA242868 


ESTs; Weakly similar to house-keeping protein fM.musculusI 






AA251776 


iun D proto-oncogene 




15 


AA251909 


budding uninhibited by benzimidazoles 1 (yeast homolog); beta 






AA252672 


diptheria toxin resistance protein required for diphthamide 
biosynthesis (Saccharomyces]-like 2 






AA256157 


ESTs 






AA256680 


Homo sapiens mRNA; cDNA DKFZp564H1916 (from clone 






AA258873 


ESTs 




20 


AA262727 


KIAA1 033 protein 






AA281451 


DKFZP564A043 protein 






AA281545 


nuclear receptor co-repressor 1 






AA282069 


KIAA0603 gene product 






AA283044 


ESTs 




25 


AA283930 


ESTs 






AA284755 


CDW52 antigen (CAIVIPATH-1 antigen) 






AA291268 


DKFZP586L0724 protein 






AA291927 


ESTs 






AA343514 


ESTs 




30 


AA398109 


ESTs 






AA405737 


ESTs 






AA406610 


ESTs 






AA411465 


ESTs; IVIoderately similar to HMG-box transcription factor 






AA416886 


Homo sapiens mRNA; cDNA DKFZp564C1563 (from clone 




35 


AA424013 


Homo sapiens clone 23767 and 23782 mRNA sequences 






AA424148 


DKFZP434I1 16 protein 






AA424558 


phosducin-like 






AA424961 


similar to S. cerevisiae SSM4 






AA425367 


ESTs 




40 


AA425921 


ESTs 






AA426220 


KIAA0523 protein 
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AA427735 


ESTs 






AA430673 


ESTs 






AA432248 


ESTs 






AA4358g6 


ESTs 




5 


AA436705 


KIAA0766 pene product 






AA446561 


KIAA0470 gene product 






AA448238 


KIAA091 5 protein 






AA448688 


ESTs; Weakly similar to KIAA0638 protein [H .sapiens] 






AA449756 


ESTs; Weakly similar to II!! ALU SUBFAMILY J WARNING 




10 


AA450303 


ESTs 






AA45241 1 


ESTs; Highly similar to mediator [H .sapiens] 






AA454566 


hemoglobin; qamma G 






AA454667 


ESTs 






AA456437 


ESTs 




15 


AA456646 


ESTs 






AA456826 


ESTs 






AA456981 


ESTs 






AA458959 


ESTs 






AA459950 


ESTs 




20 


AA460449 


ESTs; Highly similar to phosphoserine aminotransferase 






AA463910 


ESTs 






AA464603 


ESTs 






AA464606 


MRS1 protein 






AA465093 


TIA1 cytotoxic granule-associated RNA-binding protein 




25 


AA465692 


KIAA0648 protein 






AA476473 


triple functional domain {PTPRF interacting) 






AA478109 


ESTs 






AA478474 


ESTs 






AA480889 


ESTs 




30 


AA485223 


ESTs 






AA485254 


ESTs 






AA486183 


ESTs; Weakly similar to similar to oxysterol-binding proteins 






AA496936 


ESTs 






AA598589 


ESTs 




35 


AA598831 


ESTs 






AA600150 


ESTs 






AA608545 


RAD51 {S. cerevisiae) homolog (E coli RecA homoloq) 






AA609210 


ESTs 






AA610108 


ESTs; Highly similar to CGI-124 protein [H.sapiensl 




40 


AA620582 


ESTs; Weakly similar to KIAA0869 protein [H.sapiens] 






AA621239 


ESTs; Highly similar to ALG-2 interacting protein AIP1 






AA621714 


ESTs 
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AA621718 


ESTs; Moderately similar to CGI-74 protein fH.sapiens] 






D19673 


ESTs 






D25755 











DKFZP586E1621 protein 






D60272 


ESTs; Weal<ly similar to macrophage lectin 2 [H.sapiens] 






108879 


cathepsin F 






T34527 


UDP-N-acetyl-alpha-D-galactosamine: polypeptide 
N-acetylgalactosaminyltransferase 1 (GalNAc-TI) 






T40327 


lung resistance-related protein 






T62771 


nucleophosmin/nucleoplasmin; 3 




10 


T63174 


Homo sapiens mRNA; cDNA DKFZp586l0324 (from clone 






T83444 


KIAA0887 protein 






T93641 


ESTs 






U48263 


prepronociceptln 






1149065 


interleul<in 1 receptor-like 2 




15 


U79300 


Human clone 23629 mRNA sequence 






IJ88573 









U93867 


polymerase (RNA) III (DNA directed) (62kD) 






W01094 


ESTs 






WO 1568 


ESTs 




20 


W26853 


cartilage oligomeric matrix protein (pseudoachondroplasia; 
epiphyseal dysplasia 1; multiple) 






W27179 


BCL2/adenovirus E1B 19kD-jnteractinq protein 3-like 






W27965 


EST 






W36280 


NS1 -associated protein 1 






W47063 


ESTs 




25 


W79060 


isocitrate dehydrogenase 2 (NADP+); mitochondrial 






W88550 


KIAA1058 protein 






X60486 


H4 histone family; member G 






X78931 


zinc finger protein 272 






Z14077 


YY1 transcription factor 




30 


AA002147 


EST 






AA00471 1 


ESTs 






AAO 10383 


EST 






AA01 5761 


i^^i 






AAUlo/ 1 


§ 






AA021473 


i§I 






AA024835 


potassium voltage-gated channel; delayed-rectifier; subfamily S; 






AA025858 


Homo sapiens mRNA; cDNA DKFZp586B1024 (from clone 






AA027229 


ESTs; Weakly similar to F45E12.5 FCelegans] 






AA029428 


ESTs 




40 


AA035143 


ESTs 






AA035237 


butyrate response factor 2 (EGF-response factor 2) 
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AA039347 


.§§1 






AA040740 


_ESTs 






AA041551 


.§§15 






AA045513 


_ESTs 






AA045745 


ESTs 






AA055348 


l§]i 






AA056582 


KIAA0372 gene product 






AA056697 


ESTs 






AA056746 






10 


AA057678 


— 

ESTs 






AA058681 


ESTs 






AA058686 


ESTs 






AA062840 








AA064859 






15 


AA065069 








AA069923 








AA070799 


zinc finger protein 6 (CMPX1 ) 






AA070815 








AA075374 








AA076382 








AA078787 


ESTs 






AA078986 








AA079393 








AA079487 






25 


AA083207 


EST 






AA083256 








AA084415 








AA085274 








AA088678 


ESTs 




30 


AA1 00925 


stress-associated endoplasmic reticulum protein 1 ; ribosome 
associated membrane protein 4 






AA101255 


Homo sapiens mRNA for H-2K binding factor-2; complete cds 






AA 126474 








AA1 27017 


ESTs 






AA1 29968 


ESTs; Weakly similar to PROTEIN PHOSPHATASE PP2A; 130 
KD REGULATORY SUBLIMIT fh. sapiens] 




35 


AA 130240 


_ESTs 






AA131866 


ESTs; Weal<ly similar to DY3.6 fC.elegans] 






AA1 32039 








AA1 32983 


DKFZP586G1 51 7 protein 






AA1 33250 


ESTs 




40 


AA133583 


high-mobility group (nonhistone chromosomal) protein isoform l-C 






AA135941 


ESTs 






AA148650 
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AA151110 


ESTs 






AA1 55754 


EST 






AA156125 


ESTs; Moderately similar to hedgehog-Interacting protein 
[M.musculus] 






AA1 56289 


ESTs 






AA1 56997 








AA 157291 


EST 






AA 157293 


ESTs 






AA1 64293 


ESTs 






AA1 64676 


ESTs; Weakly similar to weak similarity to S. cerevisiae 
intracellular protein transport protein US)1 [C.elegans] 




10 


AA1 67375 


KIAA0530 protein 






AA1 67550 


Homo sapiens mRNA; cDNA DKFZp564i\^1 1 3 (from clone 






AA1 76589 








AA1 80448 


— 






AA187144 


— \ 

endothelin 1 




""■^ 


AA189170 


ESTs 






AA1 92757 


ESTs 






AA205650 


ESTs 






AA233342 


ESTs; Weakly similar to WD40 protein Ciao 1 [H.sapiens] 






AA233472 


ESTs 






AA234110 


ESTs 






D80981 


ESTs 






F01660 


^^"""^ — : : 






F02206 


EST; Highly similar to ether-a-go-go-related protein [H.sapiens] 






F02208 


ESTs 




25 


F02544 


ESTs 






F03918 


ESTs ^ ^ 








pyrophosphatase (inorganic) 






F04600 


ESTs 






F08998 


ESTs 




30 


F09605 


ESTs 






F11115 


ESTs 






H06371 


Homo sapiens clone 24993 mRNA sequence 






H 1 0995 


Homo sapiens mRNA full lenqth insert cDNA clone EUROIMAGE 






j11938 


ESTs; Highly similar to histone acetyltransferase [H .sapiens] 




35 


j16568 








H16772 


ES? 






HI 8951 


to 1 s, ivioaeraieiy similar xo uj i iooj i .1 in.sapiens] 






H20859 


ESTs 






H23747 


ESTs 




40 


H38087 


ESTs; Weakly similar to NG22 [H.sapiens] 






H40331 


ESTs 






H40567 


ESTs 





147 



wo 01/11086 



PCT/USOO/22061 





Exemplar 
Accession 


Complete Title 


UniGenelD(1 1/29/99) 




H46966 








H56640 


ESTs 






H57154 


ESTs; Weakly similar to organic anion transporter 1 [H .sapiens] 






H96712 


ESTs 




5 


N20814 


ESTs 






N25249 


synaptosomai-associated protein; 23kD 






N27100 


keratin 5 (epidermolysis bullosa simplex; 






N39616 


RNA (guanine-7-) methyltransferase 






N48982 


I§IJ 




10 


N51957 


■^^^^ — : — : — \ — : 






N52271 


LIM protein (similar to rat protein kinase C-binding enigma) 






N59435 


ESTs; Highly similar to CGI-112 protein [H.sapiens] 






N64139 


ESTs; Weakly similar to large tumor suppressor 1 [H.sapiens] 






N66981 


ESTs 




15 


N68640 


ESTs 






N69352 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 15 






N95226 


KIAA0758 protein 






R00138 


ESTs 






R07998 


ESTs; Weakly similar to Ml! ALU SUBFAMILY J WARNING 




20 


R08929 


ubicjuitin-conjugsting dnzyms E2G 2 (homologous to ydsst UBC7) 






R10307 








R33354 


ESTs 






R36083 


ESTs 






R37938 


KIAA0440 protein 




25 


R39330 








R40816 


cullin 4A 






R43162 


ESTs 






R45698 


ESTs; Weakly similar to cAMP inducible 2 protein [M.muscuius] 






R54554 


ESTs 




30 


R68425 


ESTs 






R6856a 


ATX1 (antioxidant protein 1: yeast) homolog 1 






R68763 


ESTs 






R70467 


ESTs 






R73565 


Homo sapiens mRNA; cDNA DKFZp564l\1113 (from clone 




35 


R73640 


ESTs 














R92453 


EST 






T03865 


ESTs 






T03872 


ESTs 




40 


T10072 


ESTs 






T10080 


ESTs 






T10132 


KIAA0478 gene product 





148 
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Accession 


Complete Title 


UniGenelD(1 1/29/99) 












T23457 


est! 








T23555 


est! 






T23670 


EST^ 




5 


T23948 


ESTs 






T33464 


ESTs 






T34413 


ESTs 






T34611 


ESTs 






T40920 


ESTs 




10 


T55182 


ESTs; Highly similar to IGF-II mRNA-bindinq protein 2 [H. sapiens) 














T84039 




ESTs 

—J 









5 






T87693 








1 0900U 


— 








est! 






T90987 


ESTs 






T91863 


ESTs 






T91881 


KIAA0563 pene product 




20 


T93783 


ESTs 






T96687 


ESTs 






T96944 


Homo sapiens mRNA; cDNA DKFZp434H132 (from clone 






T97307 


ESTs; Moderately similar to III! ALU SUBFAMILY J WARNING 






T97764 


ESTs 




25 


W48817 


ESTs 






W58343 


DKFZP586B2420 protein 






W59949 


ESTs; Moderately similar to GTP-BINDING PROTEIN TC10 






W74644 


ESTs 






VV / 4 / 0 1 


ESTs; Highly similar to ubiquitin-conjuqating enzyme HBUCE1 




30 


W74802 








W81 205 


est! 






A/8 1 237 


ESTs 






W90146 


ESTs 






W92798 


ESTs 




35 


Z38412 


EST 






Z38709 


inositol 1 ;4;5-triptiosptiate receptor; type 2 






Z38904 


ESTs; Weakly similar to KIAA0970 protein [H.sapiens] 






Z39103 


core-binding factor; runt domain; alpha subunit 2; translocated to; 






Z39930 


calreticulin 




40 


Z39939 


ESTs 






Z40012 


NCK-associated protein 1 






Z40377 


ESTs 





149 
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PCTAJSOO/22061 



Exemplar 
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Complete Title 


UniGenelD(1 1/29/99) 


Z40820 


ESTs 







Homo sapiens mRNA; cDNA DKFZp566P013 (from clone 










AA005112 


LIM domain only 7 




AA005432 


DKFZP547E2110 protein 




AA010163 


upstream regulatory element binding protein 1 




AA026356 


ESTs 




AA026901 


ESTs 




AA036867 


ESTs; Weal<ly similar to coded for by C. eleqans cDNA ykSObS 5 




AA044644 


lymphocyte-specific protein 1 




AA046426 


Cdc42 effector protein 3 




AA05451 5 


ESTs; Weakly similar to prostate-specific transglutaminase 




AA084162 






AA085749 


ATP binding protein associated with cell differentiation 




AA098874 


DKFZP434I116 protein 




AA101056 






AA102746 


ESTs 




AA1 14250 


KIAA0512 qene product 




AA126561 


stanniocalcin 




AA1 28980 


ESTs 




AA1 29757 


ESTs; Weakly similar to 60S RIBOSOMAL PROTEIN L22 




AA1 29921 


S-adenosylhomocysteine hydrolase-like 1 




AA1 33331 


KIAA0741 qene product 




AA1 35958 


ESTs 




AA1 36524 


eukaryotic translation elongation factor 1 alpha 1 




AA 147044 


ESTs; Weakly similar to ALU CLASS C WARNING ENTRY 




AA1 48885 


minichromosome maintenance deficient (S. cerevisiae) 4 




AA1 50043 


^^^^^5 




AA151621 


^^"""^ 





AA1 55743 


ferritin; light polypeptide 




AA1 56335 


^ — 




AA1 56336 


nuciear receptor CO- 'e :•• ■ • i 




AA159181 


ESTs; Weakly similar to LpaSp [S.cerevisiael 




AA1 59825 


ESTs; Weakly similar to ORF YNL227C rs.cerevisiael 




AA234185 


ESTs 




AA234929 






AA234935 


ESTs 




AA236359 


ESTs 




AA236466 


ESTs 




AA236535 


Human clone 23654 mRNA sequence 




AA236935 


Human normal keratinocyte mRNA 




AA236942 


ESTs 1 
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Exemplar 


Complet6 Title 


• 

UniGenelD(1 1/29/99) 




AA237018 


ESTs 






AA237025 


ESTs 






AA242751 


KIAA0903 protein 






AA242760 






5 


AA242763 


CDC14 (cell division cycle 14; S. cerevisiae) homolop B 






AA242809 


ESTsi Weskly simil3r to !!!! ALU SUBFAMILY J WARNING 






AA2431 33 


serine/threonine kin3S6 15 






AA243495 








AA243706 


^ctin, mannose-binding, 1 




10 


AA2 50848 


ESTs 






AA250868 


ESTs 

! 






AA251 1 52 










EST^ 






AA251792 


fatty-acid-Coenzyme A ligase; long-chain 4 




15 


AA252063 


BH-protocadherin (brain-lieart) 






AA252144 


ESTs 






AA252524 








AA253461 


ESTs 






AA255522 


ESTs; Weskly similsr to INHIBITOR OF APOPTOSIS PROTEIN 1 




20 










AA256528 


ESTs 






AA257976 


ESTs 






AA258296 


KIAA0579 protein 






AA258409 


myelin protein zero-like 1 




25 


AA258421 


liypotl^etical protein 






AA262077 


aldeiiyde deliydropenase 5 family; member A1 






AA278650 


ESTs; Weal<ly similar to similar to the beta transducin family 






AA278766 


ESTs 






AA279667 


natural killer-tumor recognition sequence 




30 


AA280791 


eul<aryotic translation initiation factor 5 






AA280819 


MADS box transcription enhancer factor 2; polypeptide C 






AA280828 


Homo sapiens mRNA; cDNA DKFZp586M141 (from clone 






AA282195 


ESTs; Weakly similar to Unknown [H. sapiens] 






AA283127 


l-iomo sapiens clone LM1955 H105e3 gene; partial cds 




35 


AA284694 


nucleoporin-like protein 1 






AA291137 


ESTs 






AA291708 


ESTs; Weakly similar to !!!! ALU SUBFAMILY SQ WARNING 






AA293495 


chromosome 8 open reading frame 1 






AA347193 


ESTs 




40 


AA398474 


Homo sapiens mRNA; cDNA DKFZp586H051 (from clone 






AA398512 


ESTs 
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Exemplar 
Accession 


Complete Title 


UniGeneIDd 1/29/99) 




AA400277 


ESTs 






AA400896 


ESTs 






AA404494 


CTP synthase 






AA410345 


ESTs; Weakly similar to junctional adhesion molecule fH.sapiens] 




5 


AA4 16733 


ESTs; Weal<ly similar to II!! ALU SUBFAMILY SC WARNING 






AA425154 


ESTs 






AA426573 


ESTs; IVIoderatelv similar to endomucin fiVl.musculus] 






AA431418 


N-acetylqlucosaminidase; alpha- (Sanfilippo disease IIIB) 






AA436182 


Human DNA sequence from clone 44A20 on chromosome 
6q23. 1-24.3. Contains a gene for a novel protein similar to 
MTHFD1 (methylenetetrahydrofolate dehydrogenase (NADP+ 
dependent); methenyltetrahydrofolate cyclohydrolase; 




10 


AA437099 


ESTs 






AA446585 


ESTs 






AA446887 


ESTs 






AA447224 


... ., 

ESTs; Weal<ly similar to cDNA EST CEESW54F comes from this 






AA447709 


ESTs; IVIoderately similar to putative transcription factor CA150 




15 


AA453624 


deoxynucleotidyltransferase; terminal 






AA455044 


ESTs 






AA456045 


ESTs 






AA460454 


ESTs; Weakly similar to KIAA0512 protein fH.sapiens] 






AA476494 


ESTs; Weakly similar to KIAA0512 protein rH.sapiens] 




20 


AA476738 


leucine rich repeat (in FLU) interacting protein 1 






AA481422 


Homo sapiens mRNA for H-2K bindinq factor-2; complete cds 






AA482269 


integral membrane protein 1 






AA482595 


ESTs; Weakly similar to F25B5.3 fCeleqansl 






AA485084 


ESTs 




25 


AA485431 


ESTs 






AA489057 


stromal antigen 2 






AA489638 


DKFZP564IVI2423 protein 






AA491000 


Homo sapiens mRNA: cDNA DKFZp586N1720 (from clone 






AA491250 


ESTs 




30 


AA505133 


solute carrier family 2 (facilitated glucose transporter); member 3 






AA598447 


exportin; tRNA (nuclear export receptor for tRNAs) 






AA599243 


general transcription factor IIIA 






AA599574 


lipase; endothelial 






AA600153 


DEK oncogene (DNA binding) 




35 


AA609309 


ESTs; Weakly similar to l!!l ALU SUBFAMILY J WARNING 






AA609710 


Human chromosome 3p21.1 gene sequence 






AA610068 


PIBF1 gene product 






AA621399 


ESTs 






/\A621752 


26S proteasome-associated pad1 homolog 
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Accession 


Complete Title 


UniGenelD(1 1/29/99) 




C21523 


ESTs 






D12160 


ESTs; IVIoderately similar to III! ALU SUBFAIVIILY J WARNING 






D19708 


ESTs 






D25801 


ESTs; Highly similar to KIAA0445 protein fH.saplensl 




5 


D45652 


a disintegrin-like and metalloprotease (reprolysin type) with 
thrombospondin type 1 motif; 4; aggrecan 1 






D60208 


ESTs 






D80504 


zinc finger protein 1 98 






F03010 


myeloid/lymphoid or mixed-lineaqe leukemia 2 






F04247 


.§§12 




10 


F10966 


Homo sapiens mRNA; cDNA DKFZp434M196 (from clone 






F13700 


ribonuclease P; 40kD subunit 






H05063 


ESTs; Weakly similar to /prediction 






H 16758 


eyl^ ropoietin receptor 






H17315 






15 


H22556 


utative translation initiation factor 






H22566 


pua^ve ransaion nitiatton actor 






H48459 


KIAA0186 QGn6 product 






H53073 










KIAA0601 protsin 




20 


H57957 








H 64938 


ES^ " 






H64973 


"isTs ' 








"eSTs ~ 






H73110 


"eSTs 




25 


H81783 


ESTs ' 






H86259 


Homo sa iens chromosome 1 co mi 

omo sapiens c romosome 19; cosmid R32611 — _ 








ESTs; Weskly similsr to Iin0-1 protQin 0RF2 [H.sspiGns] 






H88639 


YY1-associ3ted factor 2 






rl88675 






30 




sperm speci ic antigen 2 






M22107 








VI24046 


ESTs 






M27028 


ESTs ~~ 






N30205 


ESTs 




35 


N30621 


ESTs 






N33258 


nuclear receptor co-repressor 1 






N33390 


EST 






N40180 


EST 






N45198 


EST; Highly similar to similar to Cdc14B1 phosphatase 
H. sapiens! 




40 


N45979 


SH3 domain protein IB 






N48325 


EST 
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Accession 


Complete Title 


UniGeneIDd 1/29/99) 




N48913 


ESTs 






N49394 


KIAA0716 gene product 






N50656 


ESTs; Highly similar to mosaic protein LR11 [H.sapiens} 






N50721 


signal sequence receptor; gamma (translocon-associated protein 




5 


N53143 


Homo sapiens clone 25218 mRNA sequence 






N53359 


ESTs; Weakly similar to beta-TrCP protein E3RS-ll<appaB 






N55326 


ESTs 






N55493 








N57493 


EST 




10 


N62955 


ESTs; Weakly similar to KIAA0396 [H.sapiens] 






N63520 


EST 






N63604 


ESTs 






N64166 


frizzled (Drosophlla) homolop 7 






N64168 


ESTs 




15 


N64191 


ESTs 






N66845 


ESTs; Weakly similar to II!! ALU CLASS B WARNING ENTRY !!!! 






N67135 


ESTs 






N67295 


ESTs 






N68399 


H2B histone family; member N 




2 0 


N68963 


ESTs 






N69331 


peptidylprolyl isomerase C (cyclophilin C) 






N70777 


ESTs 






N71364 


ESTs 






N71545 


ESTs 




25 


N71571 


ESTs 






N 74456 


EST 






N75594 


ESTs 






N79035 


ESTs 






N80279 


hypothetical protein 




30 


N91797 








N92454 


l<aryopherin (importin) beta 1 






N94581 


actin; beta 






N94746 


ESTs 






N98238 


ESTs 




35 


R02384 


ESTs 






^16833 


ESTs; Weakly similar to ill! ALU SUBFAMILY J WARNING 
ENTRY !!!! fH.sapiensl 








myosin VA (heavy polypeptide 12; myoxin) 






R43203 


EST 






R46395 


DKFZP566A0946 protein 




40 


R58863 


ESTs 






R78248 


ESTs; Weakly similar to KIAA0970 protein fH.sapiens] 






T11483 


ESTs 
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Exemplar 
Accession 


Complete Title 


UniGeneIDd 1/29/99) 




T 16896 


ESTs 






T23820 


cyclin T2 






T30222 


ESTs; Moderately similar to tetracycline transporter-lil<e protein 






W15275 


Homo sapiens mRNA; cDNA DKFZp586E1624 (from clone 




5 


W38194 








W42414 


MAD (mothers against decapentaplegic: Drosophila) homolog 3 






W46577 


endothelial cell-specific molecule 1 






W49632 


Human clone 23908 mRNA sequence 






W57613 


ESTs 




10 


W57759 


EST 






W61118 


ESTs 






W65344 


ESTs; Moderately similar to hypothetical protein [H.sapiens] 






W69216 


ESTs 






W69379 


Homo sapiens mRNA; cDNA DKFZp586D0923 (from clone 




15 


W86728 


ESTs 






Z38499 


MKP-1 like protein tyrosine phosphatase 






Z38630 


bladder cancer related protein (lOkD) 






Z39494 


ESTs 






Z39623 


ESTs 




20 


Z40071 


BMX non-receptor tyrosine kinase 






Z40174 


ESTs 






Z40182 


I§I . 






Z40904 


EST 






AA1 66965 


ESTs 




25 


AAIO/OUU 


I§I 






AA 169599 


^^^^^ "~ 






A A1 7-1 704 


ESTs; W6dkly similar to ORF YNL059C fS.c6r6visia6l 






AA171739 









AA177105 


ESTs; Weakly similar to MITOCHONDRIAL 
CARNITINE/ACYLCARNITINE CARRIER PROTEIN fH. sapiens] 




3 0 


AA1 82626 


ESTs 






AA1 86324 


cell cycle progression 8 protein 






AA 192099 


zinc finger protein 148 (pHZ-52) 






AA192173 


ESTs 






AA192415 


ESTs 




35 


AA1 92553 


ESTs; Highly similar to RGC-32 [R.norvegicusl 






AA 194851 


ESTs 






AA1 95520 


ESTs 






AA 196300 


ESTs; Weakly similar to alternatively spliced product using exon 






AA196517 


protease; serine; 15 




40 


AA1 96549 


ESTs 






AA1 96721 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/99) 




AA1 96729 


ESTs 






AA1 96979 


ESTs; Weakly similar to protease FH.sapiensl 






AA206828 








AA207123 


immunoglobulin superfamily; member 3 




5 


AA214539 


TIA1 cytotoxic granule-associated RNA-blndinq protein 






AA226914 


nuclear receptor subfamily 2; group C; member 1 






AA227260 


Zic family member 3 (odd-paired Drosoptiila homoloc); heterotaxy 






AA227469 


EST 






AA233122 


ESTs; Highly similar to multifunctional 
E E — '. 




10 


AA233334 


Machado-Joseph disease (spinocerebellar ataxia 3; 
olivopontocerebellar ataxia 3; autosomal dominant; ataxin 3) 






AA233347 


zinc finger protein 216 






AA233519 


ESTs; Weakly similar to evectin-1 [R.norveqicusl 






AA233714 


Apg12 (autophagy 12; S. cerevlsiae)-like 






AA233796 


eukaryotic translation initiation factor 4E 




15 


AA235050 


ESTs 






AA235704 


ESTs; Weakly similar to WIscott-Aldrich Syndrome protein 






AA236031 


ESTs 






AA236352 


ESTs 






AA236390 


ESTs 




20 


AA236453 


ESTs 






AA243370 


EST 






AA250947 


ESTs 






AA251083 


ESTs 






AA251113 


ESTs 




25 


AA251973 


ESTs 






AA252023 


ESTs; Weakly similar to HRIHFB2157 rH.sapiensl 






AA252414 


ESTs 






AA252650 


mitogen-activated protein kinase kinase 7 






AA255523 


ESTs 




30 


AA258128 


ESTs 






AA262105 


Homo sapiens mRNA; cDNA DKFZp564L1916 (from clone 






AA262107 


ESTs 






AA262235 


ESTs 






AA278298 


M-phase ptiosphoprotein 1 




35 


AA278529 


serine/threonine kinase 18 






AA278721 


ESTs 






AA280036 


eukaryotic translation Initiation factor 4A; Isoform 2 






AA280648 


ESTs: Weakly similar to rab-related GTP-blndlnq protein 






AA280738 


ESTs 




40 


^A280794 


ESTs 1 
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Accession 


Complete Title 


UniGenelD(1 1/29/99) 




AA280837 


ESTs 






AA280886 


ESTs 






AA280934 


ESTs 






AA281535 


KIAA0879 protein 




5 


AA281797 


general transcription factor IIH: polypeptide 2 (44kD subunit) 






AA282047 


ESTs 






AA283002 


zinc finger protein 187 






AA283709 


calpain like protease 






AA283902 


ESTs 




10 


AA284108 


Human DNA from ctiromosome 19-speclflc cosmid F25965; 






AA284109 


Human DNA sequence from clone 71 L16 on chromosome Xpl 1 . 
Contains a probable Zinc Finger protein (pseudo)gene; an 
unknown putative gene; a pseudogene with high similarity to part 






AA284371 


interleukin 13 receptor; alpha 1 






AA284744 


ESTs; Highly similar to prefoldin subunit 2 [M.musculus] 






AA284784 


mitochondrial ribosome recycling factor 




15 


AA284840 


ESTs 






AA286844 


ESTs 






AA287032 


ESTs 






AA287038 


ESTs 






AA287546 


ESTs 




20 


AA287553 


ESTs 






AA287556 


ESTs; Weakly similar to III! ALU CLASS B WARNING ENTRY III! 






AA287564 


!DN3 protein 






AA291015 


CDC7 (cell division cycle 7; S. cerevisiae; homoloq)-like 1 






AA291716 


ESTs 




25 


AA291749 


estrogen receptor 1 






AA293656 


ESTs 






AA302430 


Human DNA sequence from clone 141H5 on chromosome 
Xq22.1-23. Contains parts of a novel Chordin LIKE protein with 
von Wiliebrand factor type C domains. Contains ESTs; STSs and 






AA302809 


EST 






AA302820 


purinergic receptor P2X; ligand-gated ion channel; 4 




30 


AA3 10499 


ESTs 






AA321890 








AA340589 


EST 






AA340622 


ESTs 






AA342457 


ESTs; Moderately similar to III! ALU SUBFAMILY SQ WARNING 




35 


AA342828 


glycoprotein V (platelet) 






AA342864 


ESTs 






AA342973 


ESTs 






/\A346495 


ESTs 






/\A347573 


fibronectin leucine rich transmembrane protein 2 
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wo 01/11086 



PCTAJSOO/22061 



10 



20 



25 



30 



40 



Exemplar 
Accession 


Complete Title 


UniGeneIDd 1/29/99) 


AA347614 


ESTs 




AA347717 


ESTs 




AA348913 


ESTs 




AA349647 


EST 




AA349773 


ESTs 




AA350541 


ESTs 




AA357159 


EST 




AA357172 


ESTs 




AA369856 


vacuolar protein sorting 41 (yeast homolog) 




AA370132 


EST 




AA370472 


ESTs 




AA370867 


ESTs 




AA377296 


ESTs 




AA383902 


ESTs; Weal<ly similar to !!!! ALU SUBFAMILY J WARNING 




AA385934 


EST; Highly similar to predicted using Genefinder |G.eleqansl 




AA386255 


EST 




AA386260 


EST 




AA386266 


ESTs; Weakly similar to IVI6a riH.sapiens] 




AA398014 


ESTs 




AA398222 


ESTs 




AA398235 


ESTs 




AA3g8348 


ESTs 




AA398482 


EST 




AA398504 


ESTs 




AA398505 


ESTs 




AA398507 


nucleoporin 50kD 




AA398523 


ESTs 




AA398625 


ESTs 




AA398632 


ESTs 




AA398633 


ESTs 




AA398894 


ESTs 




AA398895 


EST 




AA398900 


ESTs 




AA398904 


EST 




AA399122 


ESTs; Weakly similar to mitochondrial citrate transport protein 




AA399371 


ESTs; Weakly similar to zinc finger protein SALL1 [H.sapiens] 




AA399373 


ESTs; Highly similar to KIAA0568 protein [H.sapiens] 




AA399441 






AA399636 


ESTs 




AA399640 


ESTs 




AA399680 


ESTs 




AA400080 


ESTs 




AA400262 


ESTs 
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Accession 
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UnlGenelDd 1/29/99) 




AA400725 


ESTs 






AA400748 


Homo sapiens mRNA: cDNA DKFZp434D024 (from clone 






AA400780 


ESTs 






AA401631 


ESTs 




5 


AA401688 


ESTs 






AA401695 


EST 






AA402227 


ESTs: Weakly similar to TROPOMODULIN [H. sapiens] 






AA402329 


phosphodiesterase 4A; cAMP-specific (dunce 
(Drosophila)-homolog phosphodiesterase E2) 






AA402398 


ESTs 




10 


AA402449 


1§I 






AA402468 


.§§15 






AA403268 









AA403314 


I§l5 






AA404229 






15 


AA404260 


— 

I§]J 






AA404271 


glutamate receptor; ionotropic; kainate 1 






AA405026 


.§§Is 






AA405182 


I§]j 






AA405237 






20 


AA406061 


— 






AA406063 









AA406070 


I§I 






AA4061 37 


I§I 






AA406335 


Mil 




25 


AA41 1801 


KIAA0307 gene product 






AA4 1 1 804 


^^^^ — 






AA41 1833 


ESTs; Highly similar to Trad [IH. sapiens] 






AA412219 


ME! 






AA4 1 2259 







30 


AA41 2497 








AA4 1 2498 


"est 






AA4 16586 


ESTs 






AA4 16867 


EST 






AA4 16874 


ESTs 




35 


AA421133 


ESTs 






AA421138 


EST 






AA422079 


ESTs; Wsakly similar to RAR-RESPONSiVE PROTEIN TiG1 






AA423837 


ESTs 






AA424328 


ESTs 




40 


AA424339 


ESTs 






AA424469 


ESTs 






AA424502 


ESTs 
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wo 01/11086 



PCTAJSOO/22061 





Exemplar 


ComplGtG TitiG 


UniGenelD(1 1/29/99) 




AA425004 


'■ '. : 






AA425734 


ESTs; Weakly similar to hypothetical protGin [H.sapiGns] 






AA425887 








AA426456 


est! ' 






AA427396 


ESTs " 

__i . 






fVVtiL 1 ooo 


KIAA0203 Qene product 








^^^^ . : 






AA428242 


transcription factor 9 (binds GC-rich sequences) 






AA428281 






10 


AA428865 


E^ ' 






AA4 28994 


ESTs ~ 






AA4 29666 


EST^ 






AA4301 81 


EST^ ~ 






AA430184 


^■j.p^g.^.p ^.^^.^ — 




15 


AA431288 


— " '" .'"^ — : 

CD3D antipGn; delta polypeptide (TiT3 complex) 






AA431293 








AA431478 


ESTs ^ 






AA431492 


EST^ ' 






AA431732 


EST " 




20 


AA432278 


ESTs """ 






AA43441 1 


ESTs ~~ 






AA435512 


ESTs 






AA435698 


ESTs ^ 






AA43571 1 


KIAA0712 ene roduct 




25 


AA4358 1 5 


Clk-associatinQ RS-cyclophilin 






AA435842 


ESTs 






AA436475 


ESTs 






AA436489 


ESTs 






AA442060 


ESTs 




30 


AA442079 


ESTs 






AA443151 


ESTs; Weakly similar to weak similarity with Quinone 






AA446133 








AA447145 


Homo sapiens KIAA0399 mRNA; partial cds 






/^4473gg 






35 


AA447643 


ESTs 

- — ^- — • : \ 






/\A447742 


dynein; axonemali heavy polypeptide 17-like 






'\A448226 








(\A448825 


EST " 






i^A449444 


ESTs 




40 


W50087 


regulator of Gz-selective protein siqnalinq 






^A450211 


EST 






W50244 


ESTs 






V\452123 


ESTs; Weakly similar to T-complex protein 10A fH-saplens] 






!\A452155 


zinc finger protein 198 | 





160 



wo 01/11086 



PCTAJSOO/22061 





Exemplar 
Accession 


Complete Title 


UniGenelD(1 1/29/99) 




AA452156 


EST 






AA453036 


ESTs; Weakly similar to similar to molybdoterin biosynthesis 






AA453526 


.§§]^ 






AA454085 








AA454 1 03 




s 






AA454642 


. 






AA454935 


nuclear respiratory factor 1 






AA456323 


ESTs 






AA457395 


ESTs 




10 


AA458850 








AA459662 


. 






MA40yDDd 


3-liydroxyisobutyryl-Coen2yme A hydrolase 






AA459679 


ESTs; Weakly similar to The KIAA01.91 qene is expressed 






AA459702 


ESTs 




15 


AA460017 


ESTs; Weakly similar to diaphanous-related formin rM.musculus] 






AA460324 


ESTs _ 






AA46 1 509 


ESTs; Weakly similar to putative p150 [H.sapiens] 






AA464414 


ESTs 






AA464428 


ESTs 




20 


AA470084 


ESTs 






AA476606 


ESTs 






AA478521 


ESTs 






AA478523 


ESTs; Moderately similar to !!!! ALU SUBFAMILY J WARNING 






AA479949 


RAB2; member RAS oncogene family 




25 


AA481252 


oncogene TC21 






AA485351 


KIAA1067 protein 






AA487264 


§§I2 . 






AA489072 


KIAA0870 protein 






AA489630 


KIAA0665 gene product 




30 


AA490225 


ESTs 






AA490227 


ESTs 






AA490255 


ESTs 






AA490890 


ESTs 






AA490916 


ESTs 




35 


AA490925 


epilepsy; progressive myoclonic epilepsy; type 2 qene; Lafora 






AA490955 


ESTs- Weakly similar to bullous pemphigoid antigen [M musculusl 






AA495812 


ESTs 






AA495824 


ESTs 






AA496369 


ESTs 




40 


AA504125 


ESTs 






AA521473 


SEC10 (S. cerevisiaeHike 1 
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Exemplar 
Accession 


Complete Title 


UniGeneIDd 1/29/99) 




AA598440 


EST 






AA598899 


Homo sapiens mRNA; cDNA DKFZp564D036 (from clone 






AA599244 


KIAA0530 protein 






AA599694 


KIAA0133 gene product 




5 


AA600037 


ESTs 






AA609135 


ESTs 






AA609582 


katanln p60 (ATPase-contalnlng) subunit A 1 






AA609684 


ESTs 






AA609839 


4-nitrQphenylphosphatase domain and non-neuronal SNAP25-like 






AA609862 


RNA-brnding protein gene with multiple splicing 






AA620423 








AA620747 


— 

ESTs 






AA62 1 364 


ESIs 






C20653 


5 — ; . 






D20085 


ESTs; Weakly similar to KIAA0742 protein [H.sapiens] 






D20749 


I§l2 






D51285 


I§l2 






D59972 


cullln 5 






F04112 


Mis 




20 


F13604 









H01662 


Ml2 






H05135 


i§l2 






H12245 
















H30894 


EST 






H43442 


leucyl-tRNA synthetase; mitochondrial 






H45996 


putative G protein-coupled receptor 






H6g281 


ESTs 






H69485 


ESTs 




30 


H69899 


ESTs; Moderately similar to unknown fH sapiens] 






H70627 


ESTs; Weakly similar to 1!!! ALU CLASS E WARNING ENTRY !!!! 






H73050 


Rhesus blood group; D antigen 






H73260 


ESTs 






H77531 


HIR (histone cell cycle regulation defective; S. cerevisiae) 




35 


H80552 








H80737 


lysyl oxidase 






H93412 


ESTs; Weal<iy simiiar to ORF YGRIOIw fS. cerevisiae! 






H94892 


v-rai simian leukemia viral oncogene homolog A (ras related) 






H95643 


neurotrophic tyrosine kinase; receptor; type 1 




40 


H96552 


ESTs 






H97146 


ESTs; Highly similar to G protein-coupled receptor kinase 6; 






H99131 


ESTs 





162 



wo 01/11086 



PCT/USOO/22061 
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Complete Title 


UniGenelD(1 1/29/99) 




H99462 


ribosomal protein; mitochondrial; L12 






nyyoo/ 


■^^^^ — : : 






N22140 


ESTs; Weakly similar to beta-tubulin [H.sapiens] 






N22197 


Sec23-rnleracting protein p125 






N23756 


solute carrier family 23 (nucleobase transporters); member 1 






N24134 


eukaryotic translatton initiation factor 1A; Y chromosome 






N24195 


novel centrosomal protein RanBPM 






N26739 


DKFZP564B147 protein 






N27098 


EST 




10 


N27637 


ESTs 






nioouyu 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 19 (Dbp5; yeast; 






N35967 


serine/threonine kinase 24 (Ste20; yeast homolog) 






N38959 


chaperonin containinc] TCP1 ; subunit 2 (beta) 












15 




ESTs 






N48270 


EST^ 

— -2 






N48365 


2 






N51316 


— -2 . 






N51499 


A kinase (PRKA) anchor protein 2 






N53976 


Ml2 






IN04 10/ 








N54300 


EST^ 








ESTs 






N59849 


ESTs 














N62375 


EST 






N63138 


ESTs 






N63172 


cell division cycle 42 (GTP-binding protein; 25kD) 






N63772 


novel putative protein similar to YIL091C yeast hypothetical 84 kD 
protein from SGA1-KTR7 






N 63787 


sema domain; immunoglobulin domain (Ig); short basic domain; 






N681 68 








N68201 


EST^ 






N68300 


ESTs 






N68321 






35 




EST 






N75007 


ESTs; Moderately similar to KIAA1004 protein [H.sapiens] 






N75542 


transcription factor 4 






N90066 


0-linked N-acetylglucosamine (GlcNAc) transferase 
(UDP-N-acetylglucosamine:polypeptide-N-acetvlqlucosaminyl 






N91246 


ESTs 




40 


N92751 


ESTs; Weakly similar to MICROTUBULE-ASSOCIATED 






N93214 


KIAA03 18 protein 
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Exemplar 
Accession 


Complete Title 


UniGeneIDd 1/29/99) 




N99148 


ESTs; Weakly similar to ZINC FINGER PROTEIN 83 [H.sapiens] 






R07876 


ESTs; Weakly similar to unknown [S.cerevlslae] 






R10865 


alpha-fetoprotein 






R11056 


ESTs 




5 


R11488 


ESTs 






R22947 


ESTs 






R23930 


ESTs; Highly similar to prediabetic NOD sera-reactive autoantigen 






R26589 


ESTs 






R37588 


RAB2; member RAS oncogene family-like 




10 


R37613 


Homo sapiens clone 25027 mRNA sequence 






R38398 


Homo sapiens clone 23758 mRNA sequence 






R39179 


ESTs 






R40923 


ESTs 






R41179 


Human mRNA for KIAA0328 gene; partial cds 




15 


R41294 


ESTs 






R42307 


early development regulator 2 (homolog of polyhomeotic 2) 






R43189 


ESTs 






R43306 


ESTs 






R44357 


ESTs; Weakly similar to cDNA EST EMBL:T01421 comes from 




20 


R44519 


EST; Moderately similar to Pro-Pol-dUTPase polyprotein 






R45088 








R47948 


ESTs 






R51524 


ESTs 






R54950 


ESTs 




25 


R55241 


ESTs 






R59585 


ESTs 






R60044 


ESTs; Highly similar to BETA-CATENIN fH.sapiens] 






R60872 


ESTs 






R66690 


ESTs 




30 


R67266 


exostoses (multiple )-like 1 






R73588 


ESTs 






R79403 


ESTs 






R87647 


ESTs 






R93622 


eukaryotic translation initiation factor 2; subunit 2 (beta; 38kD ) 




35 


R99599 


heterogeneous nuclear ribonucleoprotein U (scaffold attachment 






R99612 


ESTs; Moderately similar to HI! ALU SUBFAMILY J WARNING 






T02888 








T03170 


EST 






T10465 






40 


T15418 


EST 






T15597 


KIAA0661 gene product 






T15652 1 


ESTs 
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Accession 
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UniGeneIDd 1/29/99) 




T16898 


ash2 (absent; small; or homeotic; Drosophila; homolog)-like 






T26644 


ESTs; Weakly similar to zinc finger protein [H.sapiensl 






T40841 


ESTs 






T47566 






5 


T50116 








T50145 








T58615 


ESTs 






T59940 


ESTs 






T63595 


ESTs 




10 


T64891 








T64924 


ESTs 






T64933 


ESTs: Weakly similar to III! ALU SUBFAMILY SQ WARNING 
ENTRY i|H [H sapiens] 






T68875 








T69027 


ESTs 




15 


T69924 








T70353 


ESTs 






T797B0 


ESTs; Weakly similar to CGI-69 protein [IH.sapiens] 






T79951 


ESTs 






T80174 


ESTs; Moderately similar to similar to NEDD-4 fH.sapiens] 




20 


T80622 


ESTs; Weakly similar to envelope [H.sapiensl 






T85352 


ESTs 






T85373 


ESTs 






T86284 


ESTs 






T89579 


transcription factor Dp-1 




25 


T90360 


ESTs 






T94328 


ESTs 






T95590 








T97257 


ESTs 






r97599 


ESTs 




30 


T97620 


ESTs 






T97775 


EST 






T98152 


fibrillin 2(congenital contractural arachnodactyly) 






W31479 


ESTs 






W37999 


ESTs 




35 


W38240 








W40150 


chondroitin sulfate proteoglycan 6 (bamacan) 






W45435 


KIAA0784 protein 






W58202 


ESTs 






W58344 


ESTs 




40 


W58650 


ESTs 






\/V68736 


Human DNA sequence from clone 1189B24 on chromosome 
Xq25-26.3. Contains NADH-Ubiquinone Oxidoreductase IVILRQ 
subunit (EC 1.6.5.3; EC 1 .6.99.3; CI-MLRQ); Tubulin Beta and 
Proto-oncogene Tyrosine-protein Kinase FER (EC 2.7.1.112; 
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Complete Title 


UniGeneIDd 1/29/99) 




W691 06 


chromobox homolog 3 (Drosophlla HP1 qamma) 






W691 1 1 


Mis 






W69385 


nuclear mitotic apparatus protein 1 






W69399 


H1 histone family; member 0 




5 


W69459 


sex comb on midleg (Drosopiiila)-like 1 






W72424 


SI 00 calcium-binding protein A9 (calgranulin B) 






W72724 


ESTs 






W72834 


ESTs 






W73955 


Homo sapiens chromosome 19; cosmid R26445 




10 


W74701 


ESTs 






W76540 


DKFZP564G2022 protein 






W79397 


ESTs 






W85888 


ESTs; Moderately similar to III! ALU SUBFAMILY SQ WARNING 






W86038 


ESTs 




15 


W86881 








W87804 


ES? 






W88942 








W90022 


ESTs: Highly similar to LECT2 precursor [H.sapiensl 






W92272 


chromodomain hellcase DNA binding protein 3 




20 


W92764 


tumor necrosis factor; alpha-induced protein 6 






W93040 


Homo sapiens paired mesodenn homeo box 1 (PMX1 ); mRNA 






W93092 


neutral sphingomyelinase (N-SMase) activation associated factor 






W93227 


EST 






W93523 


Mil 




25 


W93659 


-§SIs 






W94003 


ESTs 






W94401 


Mis 






W94688 


perilipin 






W94787 


destrin (actin depolymerizing factor) 




30 


Z38294 


ESTs 






Z38311 


ESTs 






Z38465 


ESTs 






Z38525 


ESTs 






Z38538 


ESTs 




35 


Z38551 


ESTs 






Z38783 


Ca2+-dependent activator protein for secretion 






Z39113 


ESTs 






Z39255 


YDD19 protein 






Z39591 


EST 




40 


Z39783 


ESTs; Weakly similar to KOI H12.1 [C.elegans] 






Z39920 


ESTs; Weakly similar to NADH-CYTOCHROME B5 REDUCTASE 






Z40166 


ESTs 






Z40388 


ESTs 
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Z40646 


ESTs 




Z41697 


ESTs 




Z99349 


ESTs 




Z99394 


zinc finaer protein 36 fKOX 18) 





wo 01/11086 



TABLE 4 



PCT/USOO/22061 





Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




D86425 


Homo sapiens mRNA for nidogen-2 


Ms. 82733 


5 


D86983 


Human mRNA for KIAA0230 gene; partial cds 


Hs. 11 8893 




HG1098-HT1098 


Cystatin D 






HG1103-HT1103 


Guanine Nucleotlde-Binding Protein Ral, Ras-Oncogene Related 






HG3342-HT3519 


Id1 






J03764 


plasminogen activator inhibitor; type 1 


Hs.82085 


10 


L06797 


chemoklne (C-X-C motif); receptor 4 (fusin) 


Hs. 89414 




L15388 


Human G protein-coupled receptor kinase (GRK5) mRNA, complete cds 


Hs.211569 




L20971 


phosphodiesterase 4B; cAMP-specific (dunce (Drosophila)-homolog 
phosphodiesterase E4) 


Hs.188 




L35545 


endothelial cell protein C/activated protein C receptor 


Hs.82353 




L76380 


calcitonin receptor-like 


Hs.152175 


15 


M21305 


Human alpha satellite and satellite 3 junction DNA sequence 


Hs.247946 




M24736 


selectin E (endothelial adhesion molecule 1) 


Hs.89546 




M31166 


pentaxin-related gene; rapidly Induced by IL-1 beta 


Hs.2050 




M31551 


plasminogen activator inhibitor; type II (arginine-serpin) 


Hs, 75716 




M32334 


intercellular adhesion molecule 2 


Hs. 83733 


20 


M61916 


laminin; beta 1 


Hs. 82124 




M68874 


Human phosphatidylcholine 2-acylhydrolase (cPLA2) mRNA, complete 

cds 






M74719 


transcription factor 4 


Hs,75356 




M92934 


connective tissue grov/th factor 


Hs.75511 




M94856 


fatty acid binding protein 5 (psoriasis-associated) 


Hs.153179 


25 


U03057 


singed (Drosophila)-like (sea urchin fascin homolog like) 


Hs.11 8400 




U03877 


EGF-containing fibulin-like extracellular matrix protein 1 


Hs.76224 




U18300 


damage-specific DNA binding protein 2 (48kD) 


Hs.77602 




U27109 


Human prepromultimerin mRNA; complete cds 


Hs.32934 




U31384 


guanine nucleotide binding protein 1 1 


Hs.83381 


30 


U33053 


protein kinase C-like 1 


Hs.2499 




U59423 


IVIAD (mothers against decapentaplegic; Drosophila) homolog 1 


Hs.79067 




U70322 


karyopherin (importin) beta 2 


Hs.1 68075 




U81607 


kinase scaffold protein gravin 


Hs.788 




U83463 


syndecan binding protein (syntenin) 


Hs.81 80 


35 


□89942 


lysyl oxidase-like 2 


Hs.83354 




X04729 


Human mRNA for plasminogen activator inhibitor type 1 N-terminus 






X06256 


ntegrin; alpha 5 (fibronectin receptor; alpha polypeptide) 


Hs.1 49609 




X07820 


matrix metalloproteinase 10 (stromelysin 2) 


Hs.2258 




X54925 


matrix metalloproteinase 1 (interstitial collagenase) 


Hs.83169 


40 


X54936 


placental growth factor; vascular endothelial growth factor-related protein 


Hs.2894 




X60957 


tyrosine kinase with immunoglobulin and epidermal growth factor 

lomology domains 


Hs.78824 




X67235 


hematopoietically expressed homeobox 


Hs.1 18651 




X67951 


proliferation-associated gene A (natural killer-enhancinq factor A) | 


Hs.1 80909 
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X69910 


H. sapiens p63 mRNA for transmembrane protein 


Hs.74368 




X79981 


cadherin 5; VE-cadherin (vascular epithelium) 


Hs.76206 




Z18951 


caveolin 1 ; caveolae protein; 22kD 


Hs.247266 




AA187101 


2p61b6.r1 Stratagene endothelial cell 937223 Homo sapiens cDNA 
clone IMAGE:624659 5", mRNA sequence 




5 


N24990 


ESTs 


Hs.26418 




R81003 


Homo sapiens serine protease mRNA; complete cds 


Hs.1 54737 




AA025351 


ESTs 


Hs.1 34797 




AA02716B 


ESTs 


Hs.10031 




AA040465 


ESTs 


Hs.8728 


10 


AA045136 


ESTs 


Hs.22575 




AA054087 


phospholipase A2; group IVC (cytosolic; calcium-independent) 


Hs.1 8858 




AA071089 


ESTs; Moderately similar to Nil ALU SUBFAMILY SC WARNING ENTRY 

HI! [H. sapiens] 


Hs. 187932 




AA085918 


H.sapiens HUNKI mRNA 


Hs.247482 




AA1 87490 


ESTs 


Hs.21941 


15 


AA227926 


ESTs 


Hs.6682 




AA234743 


ESTs 


Hs.22120 




AA236559 


ESTs; Weakly similar to neuronal thread protein AD7c-NTP [H.sapiens] 


Hs.8768 




AA292694 


ESTs 


Hs.3807 




AA398243 


ESTs; Moderately similar to (defline not available 3694664) [H.sapiens] 


Hs.21806 


20 


AA406363 


ESTs 


Hs.30822 




AA4 11465 


ESTs 


Hs.8619 




AA412284 


poliovirus receptor 


Hs.1 71 844 




AA423987 


ESTs 


Hs.7567 




AA425309 


ESTs 


Hs. 33287 


25 


AA435896 


ESTs 


Hs.1 8397 




AA448238 


Homo sapiens mRNA for KIAA0915 protein; complete cds 


Hs.16714 




AA478778 


ESTs 


Hs.1 6450 




AA621714 


ESTs 


Hs.25338 




D51069 


Human isolate JuSo MUC18 glycoprotein mRNA (3' variant); complete 

cds 


Hs.211579 


30 


T34527 


UDP-N-acetyl-alpha-D-galactosamine:polypeptide 
N-acetylgalactosaminyltransferase 1 (GalNAc-TI ) 


Hs.80120 




U97519 


podocalyxin-like 


Hs.1 6426 




AA127221 


ESTs 


Hs.71059 




AA1 32983 


ESTs; Moderately similar to C-1-TETRAHYDROFOLATE SYNTHASE; 
CYTOPLASMIC [H.sapiens] 


Hs.44155 




AA 135606 


ESTs; Weakly similar to III! ALU SUBFAMILY SB WARNING ENTRY !!!! 
iH.sapiens] 


Hs.1 89384 


35 


AA156125 


ESTs 


Hs.72116 




AA1 79845 


RAB6 interacting; kinesin-like (rabkinesin6) 


Hs.73625 




AA232645 


ESTs 


Hs.42699 




F 10399 


ESTs 


Hs.14763 




HI 6772 


ESTs 


Hs.31444 


40 


N 39584 


ESTs 


Hs.1 7404 
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N52006 


UDP-N-acetyl-alpha-D-galactosamine:polypeptide 
N-acetylgalactosaminyltransferase 1 {GalNAc-T1) 


Hs. 80120 




N 53375 


Homer; neuronal Immediate early gene; 3 


Hs.166146 




N54067 


Homo sapiens mRNA for NIK; partial cds 


Hs.3628 




N64436 


ESTs 


Hs.20813 


5 


R26892 


ESTs 


Hs.221434 




T33637 


ESTs 


Hs.6841 




T57112 


yc20g1 1 .s1 Stratagene lung (#937210) Homo sapiens cDNA clone 
IMAGE:81284 3', mRNA sequence. 






W80763 


ESTs; Moderately similar to FK506-binding protein 65I<D [M.musculus] 


Hs.3849 




AA046808 


ESTs; Highly similar to 40S RIBOSOMAL PROTEIN S27 [H.sapiens] 


Hs. 108957 


10 


AA253217 








AA255991 


ESTs 


Hs. 175319 




AA258138 


ESTs 


Hs.88297 




AA426573 


ESTs 


Hs.41135 




AA443793 


ESTs 


Hs.94761 


15 


AA4g0588 


ESTs 


Hs.43118 




AA4g6257 


ESTs; Weal<ly similar to (defline not available 3513303) [H.sapiens] 


HS72165 




AA60971 7 


ESTs; Weal<ly similar to IVIICROTUBULE-ASSOCIATED PROTEIN 1B 
[H.sapiens] 


Hs. 66048 




D59570 




Hs. 17132 




F 13787 


ESTs 


Hs,58596 


20 


H88157 


ESTs 


Hs.41105 




H98988 


ESTs 


Hs.42612 




N34287 


unc5 (C.elegans homolog) C 


Hs.44553 




^52090 




Hs.47420 




N66845 


ESTs; Weakly similar to III! ALU CLASS B WARNING ENTRY !!!! 

[H.sapiens] 


Hs.1 65411 




M 68 905 








R32894 


ESTs 


Hs.45514 




R61715 


ESTs 


Hs.1 38237 




R71234 


yi54c08.s1 Scares placenta Nb2HP Heme sapiens cDNA clone 
MAGE'143054 3' similar to gb|M87908|HUMALNE32 Human carcinoma 
cell-derived Alu RNA transcript, (rRNA); gb:S41458 ROD 






R98105 


yrSOgl 1 .si Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone 
MAGE:206852 3', mRNA sequence. 




30 


T97186 


small inducible cytokine A5 (RANTES) 






\/V80814 


ESTs; Moderately similar to llll ALU SUBFAMILY SB WARNING ENTRY 

!!! [H.sapiens] 


Hs.1 93700 




AA404418 


EST 


Hs.144953 




AA405747 


ESTs; Moderately similar to HMG-box transcription factor [M.musculus] 


Hs.97865 




AA488687 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SQ WARNING ENTRY 
!!! [H.sapiens] 


Hs.190307 


35 


M.599143 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SQ WARNING ENTRY 

!!! [H.sapiens] 






^A608588 


ESTs 


Hs.1 93634 



170 



wo 01/11086 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




AA608751 


ESTs; Moderately similar to III! ALU SUBFAMILY SC WARNING ENTRY 
!!!! [H.saplens] 


Hs.244904 




C13961 


EST 


Hs.210115 




D60302 


ESTs 


Hs.1 08977 




H94892 


v-ral simian leukemia viral oncogene homolog A (ras related) 


Hs.6906 


5 


N93521 


transcription factor 4 


Hs.241362 




N95477 


ESTs 


Hs.1 02943 




R60044 


ESTs; Weakly similar to III! ALU SUBFAMILY J WARNING ENTRY llll 


Hs 106706 




R70506 


ESTs; Moderately similar to transformation-related protein [H. sapiens] 


Hs.107159 




T91518 


ye20f05.s1 Stratagene lung (#937210) Homo sapiens cDNA clone 
IMAGE;118305 3' similar to contains Alu repetitive element;contains 




10 


T95333 


ESTs; Weakly similar to Strabismus [D.melanogaster] 


Hs.122730 




R45630 


ESTs; Highly similar to KIAA0372 [H.sapiens] 


Hs.1 70098 




R20839 


yg05c07.r1 Scares infant brain 1NIB Homo sapiens cDNA clone 
IMAGE:31444 5', mRNA sequence. 






R23858 


ESTs; Moderately similar to envelope protein [H.sapiens] 


Hs.23986 




Al 024874 


ESTs; Weakly similar to (defline not available 3882257) [H.sapiens] 


Hs.57958 




W26247 


U5 snRNP-spedfic protein (220 kD); ortholog of S. cerevisiae Prp8p 


HS;6413 




AA856990 


§§l! 


Hs.125058 




AA 136653 


§§15 — . _ 






AA358869 


ESTs; Highly similar to SEC13-RELATED PROTEIN [H.sapiens] 


Hs.227949 




All 23976 


§§I2 


Hs.1 05689 




AI369384 


arylsulfatase D 






AA379500 


§§l! 


Hs.193155 




R49693 


§§l! . . . 


Hs.1 07708 




AA1 95678 


Homo sapiens mRNA for KIAA0465 protein; partial cds 


Hs.1 08258 




M30257 


vascular cell adhesion molecule 1 


Hs.1 09225 


25 


AA028131 


§§l! . 


Hs. 11 0342 




M1 0321 


Human von Willebrand factor mRNA, 3' end 


Hs.110802 




J03040 


secreted protein; acidic; cysteine-rich (osteonectin) 


Hs.1 11 779 




M86933 


amelogenin (Y chromosome) 


Hs.1 238 




AA012933 


tubulin-specific chaperone d 


Hs.241687 


30 


AA286710 


lymphocyte adaptor protein 


Hs.13131 




AA243278 


ribosomal protein; mitochondrial; L12 


Hs.1 09059 




D59711 


ESTs 


Hs.237289 




r94452 


ye36g7.s1 Stratagene lung (#93721) Homo sapiens cDNA clone 
IMAGE:1 19868 3', mRNA sequence 


Hs.241207 




AA053400 


ESTs 


Hs.241227 


35 


AA370302 


Homo sapiens mRNA; cDNA DKFZp586l1518 (from clone 

DKFZp586l1518) 






J05008 


endothelin 1 


Hs.2271 




U85193 


nuclear factor l/B 


Hs.33287 




AA256153 


ESTs 


Hs.23912 




X83107 


BMX non-receptor tyrosine kinase 


Hs.27372 


40 


AA046593 


ESTs 


Hs.28959 



171 



wo 01/11086 



PCTAJSOO/22061 





Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




AA4 10480 


ESTs 


Hs.30089 




D45304 


ESTs 


Hs.31595 




M90657 


transmembrane 4 superfamily member 1 


Hs.3337 




AA010163 


upstream regulatory element binding protein 1 


Hs.3383 


5 


AA1 36353 


ESTs 


Hs,38022 




Y07867 


plrin 


Hs.38842 




U84573 


procollagen-lysine; 2-oxoglutarate 5-dioxygenase (lysine hydroxylase) 2 


Hs.41270 




X60486 


H4 histone family; member G 


Hs. 46423 




AA 132969 


metalloprotease 1 (pitrilysin family) 


Hs.4812 


10 


AA114250 


KIAA0512 gene product 


Hs.48924 




F13782 


LIM binding domain 2 


Hs.4980 




AA283035 


ESTs; Weakly similar to l!!l ALU SUBFAMILY J WARNING ENTRY III! 
[H.saplens] 


Hs.54813 




AB002301 


Human mRNA for KIAA0303 gene; partial cds 


Hs.54g85 




AA056731 


Sjogren syndrome antigen A2 (60kD; ribonucleoprotein autoantigen 
SS-A/Ro) 


Hs.554 


15 


U6B019 


MAD (mothers against decapentaplegic; Drosophlla) homolog 3 


Hs.211578 




H99198 


ESTs: Moderately similar to THYMOSIN BETA-4 [H.saplens] 


Hs.56145 




AA598702 


bone morphogenetic protein 6 


Hs.6101 




N77151 


Homo sapiens mRNA for KIAA0799 protein; partial cds 


Hs.61638 




AA5051 33 


ESTs 


Hs.62273 


20 


AB000584 


prostate differentiation factor 


Hs.1 16577 




D 12763 


interleukin 1 receptor-like 1 


Hs.66 




AA2531 93 


ESTs 


Hs.6631 




AA432248 


ESTs 


Hs.6738 




AA083572 


v-ral simian leukemia viral oncogene homolog A (ras related) 


Hs.6906 


25 


AA479713 


ESTs 


Hs71962 




L40395 


Homo sapiens clone 23689 mRNA; complete cds 


Hs. 170001 




X52947 


gap junction protein; alpha 1; 43kD (connexin 43) 


Hs.7447i 




W80846 


vesicle-associated membrane protein 5 (myobrevin) 


Hs. 74669 




M34539 


FK506-binding protein 1A (12kD) 


Hs.752 


30 


D67029 


SEC14 (S. cerevisiae)-like 


Hs.75232 




U09587 


glycyl-tRNA synthetase 


Hs.75280 




M85289 


Human heparan sulfate proteoglycan (HSPG2) mRNA, complete cds 


Hs.211573 




D 10522 


myristoylated alanlne-rich protein kinase C substrate (MARCKS; 80K-L) 


Hs.75607 




W84712 


calumenin 


Hs.7753 


35 


D29992 


tissue factor pathway inhibitor 2 


Hs.78045 




L34657 


platelet/endothellal cell adhesion molecule (CD31 antigen) 


Hs.78146 




S78569 


lamlnin; alpha 4 


Hs.78672 




D43636 


Human mRNA for KIAA0096 gene; partial cds 


Hs.79025 




U97188 


IGF-II mRNA-binding protein 3 


Hs.79440 


40 


AA487558 


ESTs 


Hs.8135 




M28882 


Human MUC18 glycoprotein mRNA, complete cds 


Hs.211579 




X70683 


SRY (sex determining region Y)-box 4 


Hs.83484 




XI 4787 


thrombospondin 1 


Hs.87409 



172 



wo 01/11086 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




AA236324 


ESTs; Weakly similar to III! ALU CLASS A WARNING ENTRY !!!! 
[H. sapiens] 


Hs.92381 




C15324 


ESTs 


Hs.93668 




AA452000 


ESTs 


Hs.94030 




D83174 


collagen-binding protein 2 (colligen 2) 


Hs.9930 




D00596 


Homo sapiens gene for thymidylate synthase; exons 1 ; 2; 3; 4; 5; 6; 7; 
complete cds 


Hs. 196351 




D11428 


peripheral myelin protein 22 


Hs. 103724 




D13640 


major histocompatibility complex; class 1; C 


Hs. 18361 8 




D14874 


adrenomedullin 


HS;394 




226129 


ribonuclease; RNase A family; 1 (pancreatic) 


Hs.78224 


10 


D28476 


thyroid hormone receptor interactor 12 


Hs. 138617 




D86425 


Homo sapiens mRNA for nidogen-2 


Hs.82733 




D86983 


Human mRNA for KIAA0230 gene; partial cds 


Hs.1 18893 




D87953 


N-myc downstream regulated 


Hs. 75789 




HG1862-HT1897 


Calmodulin Type 1 




15 


HG2614-HT2710 


Collagen, Type Viii, Alpha 1 






HG2639-HT2735 


Single-Stranded Dna-Binding Protein Mssp-1 






HG2855-HT2995 


Heat Shock Protein, 70 Kda (Gb:Y00371 ) 






HG3044-HT3742 


Fibronectin, Alt. Splice 1 






HG3342-HT3519 






20 


HG3543-HT3739 


— 

Insulin-Like Growth Factor 2 






HG4069-HT4339 


Monocyte Chemotactic Protein 1 






HG417-HT417 


Cathepsin B 






J03764 


plasminogen activator inhibitor; type 1 


Hs.82085 




L06797 


chemokine (C-X-C motif); receptor 4 (fusin) 


Hs. 89414 


25 


L08246 


myeloid cell leukemia sequence 1 (BCL2-related) 


Hs. 86386 




LHLL! 


transketolase (Wernicke-Korsakoff syndrome) 


Hs. 89643 




L13977 


prolylcarboxypeptidase (angiotensinase C) 


Hs. 75693 







Human G protein-coupled receptor kinase (GRK5) mRNA, complete cds 






L19871 


activating transcription factor 3 


HS;460 


30 


L20859 


Human leukemia virus receptor 1 (GLVR1 ) mRNA: complete cds 


Hs. 78452 




L42176 


lour and a half LIM domains 2 


Hs.8302 




L49169 


Human G0S3 mRNA; complete cds 


Hs. 75678 




.76380 


calcitonin receptor-like 


Hs.1 521 75 




Vl 15990 


v-yes-1 Yamaguchi sarcoma viral oncogene homolog 1 


Hs. 194148 


35 


M23254 


calpain; large polypeptide L2 


Hs.76288 




M24736 


selectin E (endothelial adhesion molecule 1) 


Hs.89546 




M26576 


collagen; type IV; alpha 1 


Hs.119129 




M27396 


asparagine synthetase 


Hs.75692 




M31166 


pentaxin-related gene; rapidly induced by IL-1 beta 


Hs.2050 


40 


M31994 


Homo sapiens aldehyde dehydrogenase (ALDH1 ) gene, exon 13 and 

complete cds 






M32334 


ntercellular adhesion molecule 2 


Hs.83733 




M35878 


nsulin-like growth factor binding protein 3 


Hs.77326 



173 



wo 01/11086 
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Exemplar 
Accession 


Complete Title 


UniGenel 0(11/29/ 




M36429 


postmeiotic segregation increased 2-lil<e 12 


Hs.89672 




M57730 


ephrln-AI 


Hs.1624 




M57731 


GR02 oncogene 


Hs.75765 




M60858 


nucleolin 


Hs.79110 


5 


M62994 


filamin B; beta (actin-binding protein-278) 


Hs.81008 




M68874 


Human phosphatidylcholine 2-acylhydrolase (cPLA2) mRNA, complete 
cds 






M69043 


nuclear factor of kappa light polypeptide gene enhancer in B-cells 
inhibitor; alpha 


Hs.81328 




M74719 


transcription factor 4 


Hs.75356 




M75126 


hexoklnase 1 


Hs.11 8625 




M84349 


CD59 antigen p 18-20 (antigen identified by monoclonal antibodies 
16.3A5; EJ16; EJ30; EL32 and G344) 


Hs. 11 9663 




M92843 


zinc finger protein homologous to Zfp-36 in mouse 


Hs.198309 




M92934 


connective tissue growth factor 


Hs.75511 




M93056 


protease inhibitor 2 (anti-elastase); monocyte/neutrophil 


Hs.1 83583 




M94856 


fatty acid binding protein 5 (psoriasis-associated) 


Hs.153179 


15 


M95787 


transgelin 


Hs.75777 




S76965 


Protein kinase inhibitor [human; neuroblastoma cell line SH-SY-5Y; 
mRNA; 2147 nt] 


Hs. 75209 




S81914 


DIFFERENTIATION-DEPENDENT GENE 2 


Hs. 76095 




U03057 


singed (Drosophila)-like (sea urchin fascin homolog like) 


Hs.1 18400 




U03100 


catenin (cadherin-associated protein); alpha 1 (102kD) 


Hs. 178452 


20 


U03877 


EGF-containing fibulin-like extracellular matrix protein 1 


Hs.76224 




U08021 


nicotinamide N-methyltransferase 


Hs.76669 




U14391 


myosin IC 


Hs.82251 




U31384 


guanine nucleotide binding protein 1 1 


Hs.83381 




U32944 


dynein; cytoplasmic; light polypeptide 


Hs.5120 


25 


U40369 


Human spermidine/spermine Nl-acetyltransferase (SSAT) gene, 
complete cds 






U41767 


Human metargidin precursor mRNA, complete cds 






U48959 


Homo sapiens myosin light chain kinase (MLCK) mRNA; complete cds 


Hs.75950 




U51010 


Human nicotinamide N-methyltransferase gene, exon 1 and 5' flanking 
region 






U51478 


ATPase; Na+/K+ transporting; beta 3 polypeptide 


Hs.76941 


30 


U53445 


Human ovarian cancer downregulated myosin heavy chain homolog 
(Doci ) mRNA; complete cds 


Hs.1 5432 




U59289 


cadherin 13; H-cadherin (heart) 


Hs.63984 




U59423 


MAD (mothers against decapentaplegic; Drosophila) homolog 1 


Hs.79067 




U62015 


Homo sapiens Cyr61 mRNA, complete cds 






U63825 


Human hepatitis delta antigen Interacting protein A (dipA) mRNA; 
complete cds 


Hs. 66713 


35 


U67963 


Human lysophospholipase homolog (HU-K5) mRNA; complete cds 


Hs.6721 




U73379 


Human cyclin-selective ubiquitin carrier protein mRNA; complete cds 


Hs.93002 




U73824 


eukaryotic translation initiation factor 4 gamma; 2 


Hs.1 83684 




U77604 


microsomal glutathione S-transferase 2 


Hs.81874 




U81607 


kinase scaffold protein gravin 


Hs.788 



174 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




U89942 


lysyl oxidase-like 2 


Hs.83354 




X04412 


gelsolin (amyloidosis; Finnish type) 


Hs.80562 




X06985 


heme oxygenase (decycling) 1 


Hs.75967 




X07820 


matrix metalloproteinase 10 (stromelysin 2) 


t!i2258 


5 


X 12876 


l<eratin 18 


Hs.65114 




X15729 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 5 (RNA hellcase; 68kD) 


Hs.76053 




X52541 


early growth response 1 


Hs.738 




X53416 


filamin A; alpha (actin-binding protein-280) 


Hs.76279 




X54489 


GR01 oncogene (melanoma growth stimulating activity; alpha) 


Hs.789 


10 


X54925 


matrix metalloproteinase 1 (interstitial collagenase) 


Hs.83169 




X57206 


Inositol 1 ;4;5-trisphosphate 3-i<inase B 


Hs.78877 




X59798 


cyclin D1 (PRAD1: parathyroid adenomatosis 1) 


Hs.82932 




X60957 


tyrosine i<inase with Immunoglobulin and epidennal growth factor 
homology domains 


Hs .78824 




X65965 


H.sapiens SOD-2 gene for manganese superoxide dismutase 




15 


X6911J 


inhibitor of DNA binding 3; dominant negative heiix-loop-helix protein 


Hs .76884 




X70940 


eukaryotic translation elongation factor 1 alpha 2 


Hs.2642 




X87838 


catenin (cadherin-associated protein); beta 1 (88kD) 


Hs. 171271 




X91247 


thioredoxin reductase 1 


Hs. 13046 




X97748 


H.sapiens PTX3 gene promoter region 




20 


Y0081 5 


protein tyrosine phosphatase; receptor type; F 


Hs.75216 




AA30371 1 


ephrin-BI 


Hs. 144700 




L44538 


ESTs 


Hs. 156044 




AA025351 


ESTs 


Hs. 134797 




AA027050 


ESTs 


Hs.31189 


25 


AA029462 


ESTs 


Hs.17235 




AA045136 


ESTs 


Hs.22575 




AA047437 


ESTs 


Hs.22968 




AA054087 


phospholipase A2; group iVC (cytosoiic; calcium-independent) 


Hs. 18858 




AA071089 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SC WARNING ENTRY 

!!!! [H.sapiens] 


Hs. 187932 


30 


AA1 56450 


ESTs; Weal<ly similar to Similar to Rat trg gene product [C.elegans] 


Hs.8982 




AA1 87490 


ESTs 


Hs.21941 




AA1 95031 


ESTs; Moderately similar to PROBABLE G PROTEIN-COUPLED 
RECEPTOR APJ [H.sapiens] 


Hs.9305 




AA205724 


ESTs 


Hs. 10119 




AA227926 


ESTs 


Hs.6682 


35 


AA227986 


ESTs 


Hs.25329 




AA234743 


ESTs 


Hs.22120 




AA253216 


ESTs 


Hs.22283 




AA256210 


oncomodulin 


Hs.199134 




AA256268 


ESTs 


Hs.10283 


40 


AA279397 


ESTs; Moderately similar to fibronectin [H.sapiens] 


Hs.25001 




AA292379 


ESTs; Moderately similar to !lll ALU SUBFAMILY SQ WARNING ENTRY 
!!!! [H.sapiens] 


Hs.20340 
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wo 01/11086 
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Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




AA2927 1 7 


ESTs; Weakly similar to JM2 [H.sapiens] 


Hs.7891 




AA346551 





Hs.23457 




AA400292 




Hs.23786 




AA404338 


E^^ 


Hs.21812 


5 


AA4 12284 


poliovirus receptor 


Hs.171844 




AA423987 


ESTs 


Hs.7567 




AA428594 


ESTs 


Hs.21321 




AA430108 


ESTs 


Hs.6019 




AA431462 


ESTs 


Hs.28329 


10 


AA431470 


ESTs; Weakly similar to CAMP-DEPENDENT PROTEIN KINASE 
INHIBITOR; MUSCLE/BRAIN FORM [H.sapiens] 


Hs^3407 




AA443756 


ESTs; Moderately similar to (defline not available 4105275) [H.sapiens] 


Hs.6673 




AA449479 


ESTs; Highly similar to (defline not available 5106787) [H.sapiens] 


Hs.5216 




AA459916 


bradyklnin receptor B2 


Hs.25021 




AA465226 




Hs.28631 




AA478778 





Hs. 16450 




AA479037 


ESTs 


Hs.7961 




AA482597 


ESTs; Highly similar to (defline not available 4704739) [H.sapiens] 


Hs.26054 




AA487561 


ESTs; Highly similar to RAS-RELATED PROTEIN RAB-1A [H.sapiens] 


Hs.9813 




AA489245 


ESTs; Weakly similar to sperm specific protein [H.sapiens] 


Hs.5682 


20 


AA504110 


ESTs 


Hs.18063 




AA520989 


ESTs; Highly similar to SERINE/THREONINE PROTEIN 
PHOSPHATASE PP1-BETA CATALYTIC SUBUNIT [H.sapiens] 


Hs.9195 




AA599434 


ESTs 


Hs.25035 




AA608649 


Homo sapiens clone 23742 mRNA; partial cds 


Hs.6354 




AA609519 


ESTs 


Hs.26458 


25 


D51069 


Human isolate JuSo MUC18 glycoprotein mRNA (3' variant): complete 
cds 


Hs. 1857 18 




U97519 


podocalyxin-like 


Hs. 16426 




W28391 


proliferation-associated 2G4; 38kD 


Hs.5181 




AA035638 


Homo sapiens mRNA; cDNA DKFZp564F053 (from clone 
DKFZp564F053) 


Hs.71968 




AA083514 





Hs.68301 


30 


AA121315 


^§15 


Hs.70823 




AA147186 


^215 


Hs.92387 




MAI DDI 





"^^•^^^^^ 




AA 188932 





Hs.85640 




AA219653 





Hs.87125 


35 


AA232645 




Hs .42699 









Hs. 13233 




H48032 


ESTs 


Hs.9645 




H82117 


ESTs 


Hs.28043 




N39584 


ESTs 


Hs. 17404 


40 


N54067 


Homo sapiens mRNA for NIK; partial cds 


Hs.3628 




N59858 


ESTs 


Hs.33032 



176 
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UniGenelD(11/29/ 
99) 




N90933 


ESTs 


Hs.4867 




N93764 


ESTs; Moderately similar to !!!! ALU CLASS C WARNING ENTRY ill! 
[H .sapiens] 


Hs.10175 




R26124 


ESTs 


Hs. 24024 




R27957 


ESTs 


Hs.24230 


5 


R55470 


ESTs; Moderately similar to K02E10.2 [C.elegans] 


Hs. 11067 




T16550 


ESTs; Highly similar to vacuolar protein sorting homolog h-vps45 
[H.sapiens] 


Hs.6650 




T26674 


ESTs; Weakly similar to neuronal thread protein AD7c-NTP [H.sapiens] 


Hs.6966 




T57112 


yc20g11.s1 Stratagene lung (#937210) Homo sapiens cDNA clone 
IMAGE:81284 3', mRNA sequence. 


Hs.8881 




T88700 





Hs. 173374 


10 


T90527 


ESTs 


Hs.7890 




W42789 


ESTs 


Hs.31446 




W60002 


plastin 3 (T isoform) 


Hs.41 14 




W78175 





Hs. 17901 




W84768 





Hs.141742 


15 


W94427 


ESTs; Weakly similar to Na;K-ATPase gamma subunit [H.sapiens] 


Hs.3807 




AA253217 


ESTs 


Hs.41 271 




AA426573 


ESTs 


Hs.41 135 




AA432374 





Hs.48029 




AA446622 


1212 


Hs.74313 




AA478771 


.^§1! 


Hs. 50841 




AA482594 




Hs. 62684 




AA490588 


ESTs 


Hs.431 18 




D59570 


ESTs 


Hs. 17132 




H88157 


ESTs 


Hs.41 105 


25 


H94648 


ESTs 


Hs.41995 




H97538 


ESTs 


Hs.42392 




H98670 


ESTs; Weakly similar to (defline not available 4884081 ) [H.sapiens] 


Hs.49753 




N22107 


ESTs; Moderately similar to 1!!! ALU SUBFAMILY SC WARNING ENTRY 
!!!! [H.sapiens] 


Hs. 172241 




W38197 


Accession not listed in Genbank 




30 


W80814 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SB WARNING ENTRY 
!!!! [H.sapiens] 


Hs. 196785 




AA287347 


ESTs 


Hs. 105088 




AA402799 


ESTs 


Hs. 182538 




AA404418 


EST 


Hs. 144953 




AA425107 


ESTs 


Hs.97016 


35 


AA425435 


CO 1 s, Moueraieiy similar lo mi f\LU oUDrAiviiLT j wakiniinu tiN i ky 
Nil [H.sapiens] 


Hs.98438 




AA442872 


ESTs 


Hs.1 10771 




AA452860 


ESTs; Moderately similar to !!!! ALU SUBFAMILY SP WARNING ENTRY 

!!! [H.sapiens] 


Hs. 19721 4 




AA488687 


ESTs; Moderately similar to III! ALU SUBFAMILY SQ WARNING ENTRY 
.!!! [H.sapiens] 


Hs.1 90307 




AA599674 


ESTs; Weakly similar to ORF [D.melanogaster] 


Hs. 1081 15 
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10 



15 



Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 


F13673 


ESTs 


Hs.99769 


H99093 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide (72kD) 


Hs.6179 


N22495 


yw35g1 l.sl Morton Fetal Cochlea Homo sapiens cDNA clone 
IMAGE:254276 3', mRNA sequence. 


Hs. 10241 5 


N23031 


myosin; heavy polypeptide 7; cardiac muscle; beta 


HS;929 


R15740 


carbohydrate (chondroitin 6/keratan) sulfotransferase 1 


Hs. 1 04576 


R39610 


calpain; large polypeptide L2 


Hs.76288 


W45560 


ESTs 


Hs. 102541 


Z39833 


H. sapiens mRNA for Rho6 protein 


Hs. 124940 


Z40583 


ESTs 


Hs.101259 


AA825437 


ESTs 




R66613 


Homo sapiens mRNA; cDNA DKFZp564F053 (from clone 
DKFZp564F053) 




AA868063 


carbohydrate (chondroitin 6/keratan) sulfotransferase 1 




AA1 28075 


zl16d08.r1 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone 
IMAGE:502095 5', mRNA sequence. 




N66570 


ESTs 




Al 05 1390 


ESTs 




AA627122 


ESTs 




X02761 


fibronectin 1 


Hs. 118162 


AF010193 


MAD (motiiers against decapentaplegic; Drosopiiila) tnomolog 7 


Hs. 100602 


AA1 49044 


ESTs; Higtily similar to ttie KIAA0195 gene is expressed ubiquitously. 
[H. sapiens] 


Hs. 10086 


U82108 


solute carrier family 9 (sodium/hydrogen exchanger); isoform 3 
regulatory factor 2 


Hs.101813 


D78676 


ESTs; l\1oderately similar to (defline not available 4529890) [H.sapiens] 


Hs. 105509 


L35240 


enigma (LIIVI domain protein) 


Hs. 102948 


AA598737 


lactate dehydrogenase B 


Hs. 18041 4 


R69417 


ESTs 


Hs. 107055 


AA232837 


ESTs; Weakly similar to Human pre-mRNA cleavage factor 1 68 kDa 
subunit [H.sapiens] 


Hs. 107125 


N72695 


ESTs 


Hs. 108557 


M30257 


vascular cell adhesion molecule 1 


Hs. 109225 


M96843 


inhibitor of DNA binding 2; dominant negative helix-loop-helix protein 


Hs. 10961 7 


X68277 


dual specificity phosphatase 1 


Hs. 171 695 


AA292440 


myeloid differentiation primary response 


Hs.1 10571 


J03040 


secreted protein; acidic; cysteine-rich (osteonectin) 


Hs.111779 


AA228107 


ESTs 


Hs.54642 


AA449789 


connective tissue growth factor 


Hs.75511 


WO 1367 


ESTs 


Hs.1 70980 


AA610116 


ESTs; Highly similar to (defline not available 4325180) [H.sapiens] 


Hs.1 1663 


AA258308 


Homo sapiens mRNA; cDNA DKFZp564F053 (from clone 
DKFZp564F053) 


Hs.1 6561 8 


AA460273 


Homo sapiens mRNA for KIAA0517 protein; partial cds 


He. 12372 


AA286710 


lymphocyte adaptor protein 


Hs.13131 


T68873 


metallothionein 1 L 


Hs.1 43289 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




D63476 


PAK-tnteracting exchange factor beta 


Hs.172813 




M62403 


insulin-like growth factor-binding protein 4 


Hs.1516 




X55740 


5' nucleotidase (CD73) 


Hs.153952 




LI 0284 


calnexin 


Hs. 155560 


5 


AA243278 


ribosomal protein; mitochondrial; L12 


Hs. 109059 




AA430032 


pituitary tumor-transforming 1 


Hs.1 59626 




H16402 


ESTs 


Hs.17121 




D59711 


ESTs 


Hs.17132 




T94452 


ye36g7.s1 Stratagene lung (#93721) Homo sapiens cDNA clone 
IMAGE:119868 3', mRNA sequence 




10 


AA431571 


ESTs 


Hs.1 7894 




R79356 


Homo sapiens mRNA for KIAA0544 protein; partial cds 


Hs.1 9280 




AA280375 


ESTs 


Hs.19928 




Z49269 


small inducible cytokine subfamily A (Cys-Cys); member 14 


Hs.20144 




Z41740 


ESTs 


Hs.24462 


15 


AA121543 


Homo sapiens mRNA for KIAA0758 protein; partial cds 


Hs.22039 




J05008 


endothelin 1 


Hs.2271 




AA101878 


ESTs 


Hs.22793 




T35341 


ESTs; Highly similar to (defline not available 4519883) [H.sapiens] 


Hs.22880 




N87590 


ESTs 


Hs.23037 


20 


AA256153 


ESTs 


Hs.23912 




W74533 


Homo sapiens mRNA for KIAA0786 protein; partial cds 


Hs.24212 




□25997 


stanniocalcin 


Hs.25590 




V01512 


v-fos FBJ murine osteosarcoma viral oncogene homolog 


Hs.25647 




X56681 


jun D proto-oncogene 


Hs.2780 


25 


AA161292 


Interferon; alpha-inducible protein 27 


Hs.2867 




AA491465 


ESTs 


Hs.28792 




AA0465g3 


ESTs 


Hs.28959 




D50914 


Human mRNA for KIAA0124 gene; partial cds 


Hs.30736 




D45304 


ESTs 


Hs.31595 


30 


M90657 


transmembrane 4 superfamily member 1 


Hs.3337 




W69127 


ESTs; Weakly similar to zinc finger protein ZNF191 [H.sapiens] 


Hs.3449 




AA316186 


ESTs; Highly similar to (defline not available 42621 36) [H.sapiens] 


Hs.34549 




AA384503 


ESTs 


Hs.1 79260 




AA1 36353 


ESTs 


Hs.38022 


35 


AA044755 


ESTs; Weal<ly similar to III! ALU SUBFAMILY SX WARNING ENTRY .'!!.' 
[H.sapiens] 


Hs.1 73705 




U84573 


procollagen-lysine; 2-oxoglutarate 5-dioxygenase (lysine hydroxylase) 2 


Hs.41270 




AA058911 


ESTs; Weakly similar to membrane glycoprotein [M.musculus] 


Hs.4193 




AA620962 


dynein; cytoplasmic; light intermediate polypeptide 2 


Hs.44251 




AA285290 


small EDRK-rich factor 2 


Hs.44499 


40 


X60486 


H4 histone family; member G 


Hs.46423 




R31641 


ESTs 


Hs.197148 




AA489190 


ESTs 


Hs.48320 




F13782 


LIM binding domain 2 


Hs.4980 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 




AA257993 


Janus kinase 1 (a protein tyrosine l<inase) 


Hs.50651 




M24283 


intercellular adhesion molecule 1 (CD54); human rhinovirus receptor 


Hs. 168383 




AA443114 


ESTs; Weakly similar to Pll\/1-1 PROTO-ONCOGENE 
SERINE/THREONINE-PROTEIN KINASE [H.sapiens] 


Hs 5326 




T35289 


casein kinase 1 ; alpha 1 


Hs 195206 




N23817 


Homo sapiens clone 23675 mRNA sec|uence 






AA047151 


— : : : 


HsIsOT 




!i!ZZ15] 


Homo sapiens mRNA for KIAA0799 protein; partial cds 


ns.OTOoo 




AA480074 


— . 


nS.ozZUO 




Y00787 


interleukin 8 




10 


T99789 


^^^^ : 


Hs 6431 3 




W84341 


tissue inhibitor of metalloproteinase 2 






L09209 


amyloid beta (A4) precursor-like protein 2 


Hs 64797 




D12763 


interleukin 1 receptor-like 1 









§§]i 


ns.DoUf 


15 


AA253193 




zr^^ 




AA432248 


ES^ 


Hs.6738 




X82200 


stimulated trans-acting factor (50 kDa) 


Hs.68054 




AA083572 


v-ral simian leukemia viral oncogene homolog A (ras related) 


Hs.6906 




L00352 


low density lipoprotein receptor (familial hypercholesterolemia) 


Hs.181182 


20 


N75791 


ESTs 


Hs.7153 




X57579 


H.sapiens activin beta-A subunit (exon 2) 






X02612 


cytochrome P450; subfamily 1 (aromatic compound-inducible); 
polypeptide 1 


Ulc 79Q19 




H44631 


immediate early protein 






AA090257 


superoxide dismutase 2; mitochondrial 


Hs 177781 


25 


X83703 


H.sapiens mRNA for cytokine inducible nuclear protein 


Hs. 74019 




L40395 


Homo sapiens clone 23689 mRNA; complete cds 


Hs. 170001 




AA227913 




Hs. 198456 




X52947 


gap junction protein; alpha 1; 43kD (connexin 43) 


Hs.74471 




M11313 


alpha-2-macroglobulin 


Hs.74561 


30 


L14837 


tight junction protein 1 {zona occludens 1 ) 


Hs.74614 




M60721 


Human homeobox gene, complete cds 






D90209 


activating transcription factor 4 (tax-responsive enhancer element B67) 


Hs.181243 




T67986 


yc28e12.s1 Stratagene liver (#937224) Homo sapiens cDNA clone 
IMAGE:82030 3' similar to gb:X14723 CLUSTERIN PRECURSOR 


Hs.75106 




AA148318 


Human mRNA for KIAA0069 gene; partial cds 


Hs.75249 


35 


U97105 


dihydropyrimidinase-like 2 


Hs. 173381 




T25747 


H.sapiens OZF mRNA 


Hs.75471 




K02574 


Accession not listed in Genbank 






D78577 


tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation 

protein; eta polypeptide 


Hs.75544 




X53331 


matrix Gla protein 


Hs.75742 


40 


S73591 


upregulated by 1 ;25-dihydroxyvitamin D-3 


Hs. 179526 




X95735 


zyxin 


Hs.75873 
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Exemplar 
Accession 


Complete Title 


UniGenelD(11/29/ 
99) 


L16862 


G protein-coupled receptor kinase 6 


Hs.76297 


U44975 


Homo sapiens Kruppel-IIke zinc finger protein Zf9 mRNA; complete cds 


Hs.76526 


M97796 


Inhibitor of DNA binding 2; dominant negative helix-loop-helix protein 


Hs.180919 


U86782 


26S proteasome-associated pad1 homolog 


Hs. 178761 


AA099391 


ESTs 


Hs.77310 


M 19267 


tropomyosin 1 (alpha) 


Hs.77899 


D29992 


tissue factor pathway inhibitor 2 


Hs.78045 


L19314 


phosphorylase kinase; beta 


Hs. 19521 7 


S78569 


laminin; alpha 4 


Hs. 78672 


U28811 


Human cysteine-rich fibroblast growth factor receptor (CFR-1) mRNA, 
complete cds 




L77886 


protein tyrosine phosphatase; receptor type; K 


Hs.79005 


C14407 


neuronal tissue-enriched acidic protein 


Hs.79516 


M60278 


diphtheria toxin receptor (heparin-binding epidermal growth factor-like 
growth factor) 


Hs.799 


R81509 


splicing factor; arginine/serine-rich 1 1 


Hs. 184571 


AA487558 


ESTs 


Hs.8135 


D86962 


KIAA0207 gene product 


Hs.81875 


AA478971 


disabled (Drosophila) homolog 2 (mitogen-responsive phosphoprotein) 


Hs.81988 


D50683 


transforming growth factor; beta receptor II (70-80kD) 


Hs.82028 


U56637 


capping protein (actin filament) muscle Z-llne; alpha 1 


Hs.1 84270 


M61199 


Human cleavage signal 1 protein mRNA; complete cds 


Hs.82767 


M28882 


Human MUC18 glycoprotein mRNA, complete cds 




X15183 


CDW52 antigen (CAMPATH-1 antigen) 


Hs. 180532 


S53911 


CD34 


Hs.85289 


U20734 


Human transcription factor junB (junB) gene; 5' region and complete cds 


Hs.1 98951 


D28235 


prostaglandin-endoperoxide synthase 2 (prostaglandin G/H synthase 

and cyclooxygenase) 


Hs.9230g 


AA236324 


ESTs; Weakly similar to III! ALU CLASS A WARNING ENTRY HI! 
[H. sapiens] 


Hs. 92381 


AA1 48923 


Homo sapiens mRNA for DEPP (decidual protein induced by 
progesterone); complete cds 


Hs.93675 


AA174183 


ESTs 


Hs.93872 


AA456311 


ESTs; Weakly similar to III! ALU CLASS A WARNING ENTRY IN! 
;h. sapiens] 


Hs.93961 


L08069 


heat shock protein; DNAJ-like 2 


Hs.94 


AA452000 


ESTs 


Hs. 94030 


AA282140 


ESTs 


Hs.9587 


J02854 


myosin regulatory light chain 2; smooth muscle isoform 


Hs.9615 


AA442054 


DhosohdiDase C: aamma 1 ffonnerlv subtvoe 148) 


Hs.993 
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Accession # 


UniGenelD 








AA426573 




ESTs- Moderatel similar toendomuci Mm scul s 

ESTs, IVIoderateiy similar to endomucin [M.musculus] 






D58024 


Hs 57958 


ESTs; Weal<ly similar to KIAA0768 protsin [H.sapiGns] 




AAA8 — 


M31210 


Hs. 1542 10 


endotlielial differentiation; spiiinQolipid G-protsln-couplsd 
receptor; 1 


EDG1 




AAA7 


X06256 


Hs.1 49609 


integrin; alpha 5 (fibronectin receptor; alpha polypeptide) 


ITGA5 


AAB1 


L20859 


Hs.78452 


solute carrier family 20 (phosphate transporter); member 1 


SLC20A1 


AAB3 


X07820 


Hs.2258 


matrix metalloproteinase 10 (stromelysin 2) 


MMP10 


AAB4 


AA234743 


Hs.22120 


ESTs 




AAB5 


U97519 


MS. 104:^:0 


podocalyxin-like 




AAB6 — 


1 fn'3ft77 
UUOor I 


Hs 76224 


EGF-containing fibulin-like extracellular matrix protein 1 






M28882 


Hs 21 1 579 


melanoma adhesion molecule 


MCAM 




AAB9 — 




ns.ooloy 


matrix metalloproteinase 1 (interstitial collagenase) 


MMP1 


AAC1 


AA045136 










AA423987 


^^"^^^^^ 


ES^ 




AAC3 — 


AA234743 


Hs.22120 


ESTs 




AAC4 


AA1 56125 


Hs.72116 


ESTs; Moderately similar to hedgehog-interacting protein 
[M.musculus] 




AAC5 


AA025351 


Hs, 134797 


ESTs 




AAC6 


AA432248 


Hs.6738 


ESTs 




AAC7 


AA227926 


Hs.6682 


ESTs 




AAC8 


AA1 87490 


Hs,21941 


ESTs 




AAD1 


AA232645 


HS.42699 


ESTs 




AAD2 
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CLAIMS 

We claim: 

1 . A method of screening drug candidates comprising: 

a) providing a cell that expresses an expression profile gene which encodes a protein 
selected from the group consisting of a nucleic acid of Table 1, Table 2, Table 3, Table 4 
and Table 5 or a fragment thereof; 

b) adding a drug candidate to said cell; and 

c) determining the effect of said drug candidate on the expression of said expression 
profile gene. 

2. A method according to claim 1 wherein said determining comprises comparing the level of 
expression in the absence of said drug candidate to the level of expression In the presence of said 
drug candidate, wherein the concentration of said drug candidate can vary when present, and 
wherein said comparison can occur after addition or removal of the drug candidate. 

3. A method according to claim 1 wherein the expression of said profile gene is decreased as a 
result of the introduction of the drug candidate. 

4. A method of screening for a bioactive agent capable of binding to a angiogenesis modulator 
protein (AMP), wherein said AMP is encoded by a nucleic acid selected from the group consisting 
of a nucleic acid of Table 1 , Table 2, Table 3, Table 4 and Table 5, or a fragment thereof, said 
method comprising combining said AMP and a candidate bioactive agent, and determining the 
binding of said candidate agent to said AMP. 

5. A method for screening for a bioactive agent capable of modulating the activity of a 
angiogenesis modulator protein (AMP), wherein said AMP is encoded by a nucleic add selected 
from the group consisting of a nucleic acid of Table 1 , Table 2, Table 3, Table 4 and Table 5, or a 
fragment thereof, said method comprising: 

a) combining said AMP and a candidate bioactive agent; and 

b) detennining the effect of said candidate agent on the bioactivity of said AMP. 

6. A method of evaluating the effect of a candidate angiogenesis drug comprising: 

a) administering said dmg to a patient; 

b) removing a cell sample from said patient; and 

c) determining the expression profile of said cell. 

7. A method according to claim 6 further comprising comparing said expression profile to an 
expression profile of a healthy individual. 
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8. A method of diagnosing angiogenesis comprising: 

a) detemiining the expression of one or more genes selected from the group consisting of 
a nucleic acid of Table 1 , Table 2, Table 3, Table 4 and Table 5, or a fragment thereof in a first 
tyupe of a first individual; and 

b) comparing said expression of said gene(s) from a second normal tissue type from said 
first individual or a second unaffected individual, wlierein a difference in said expression indicates 
that the first individual has tissue that is undergoing angiogenesis. 

9. A biochip comprising a nucleic acid segment selected from the group consisting of the 
sequences set forth in Table 1, Table 2, Table 3, Table 4 and Table 5, wherein said biochip 
comprises fewer than 1000 nucleic acid probes. 

10. A biochip according to claim 9 comprising at least two nucleic acid segments. 

1 1 . A method for screening for a bioactive agent capable of interfering with the binding of an 
angiogenesis modulator protein (AMP) or a fragment thereof and an antibody which binds to said 
AMP or fragment thereof, said method comprising: 

a) combining anAMP or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said AMP or fragment thereof; and 

b) detennining the binding of said AMP or fragment thereof and said antibody. 

12. A method for inhibiting the activity of an angiogenesis modulator protein (AMP), wherein said 
AMP is encoded by a nucleic acid selected fnDm the group consisting of a nucleic acid of Table 1 , 
Table 2, Table 3, Table 4 and Table 5 or a fragment thereof, said method comprising binding an 
inhibitor to said AMP. 

13. A method according to claim 12 wherein said inhibitor is an antibody. 

14. A method of treating a disorder associated with angiogenesis comprising administering to a 
patient an inhibitor of n angiogenesis modulator protein (AMP), wherein said AMP is encoded by 
a nucleic acid selected from the group consisting of a nucleic acid of Table 1, Table 2, Table 3, 

Table 4 and Table 5 or a fragment thereof. 

1 5. A method according to claim 14 wherein said inhibitor is an antibody. 

1 6. A method of neutralizing the effect of an AMP, or a fragment thereof, comprising contacting 
an agent specific for said protein with said protein in an amount sufficient to effect neutralization. 
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17. A method for localizing a tiierapeutic moiety to angioggenic tissue comprising exposing said 
tissue to an antibody to an AMP or fragment ttiereof conjugated to said tiierapeutic moiety. 

18. The method of Claim 17, wherein said therapeutic moiety is a cytotoxic agent. 

19. The method of Claim 17, wherein said therapeutic moiety is a radioisotope. 

5 20. A method for inhibiting angiogenesis in a cell, wherein said method comprises administering 
to a cell a composition comprising antisense molecules to a nucleic acid of Table 1 , Table 2, 
Table 3, Table 4 or Table 5. 

21 . An antibody which specifically binds to a protein encoded by a nucleic acid of Table 1 , Table 
2, Table 3, Table 4 or Table 5 or a fragment thereof. 

1 0 22. The antibody of Claim 21 , wherein said antibody Is a monoclonal antibody. 

23. The antibody of Claim 21, wherein said antibody is a humanized antibody. 

24. The antibody of Claim 21, wherein said antibody is an antibody fragment. 

25. A nucleic acid having a sequence at least 95% homologous to a sequence of a nucleic acid of 
Table 1, Table 2, Table 3, Table 4 or Table 5 or its complement. 

15 26. A nucleic acid which hybridizes under high stringency to a nucleic acid of Table 1, Table 2, 
Table 3, Table 4 or Table 5 or its complement. 

27. A polypeptide encoded by the nucleic acid of Claim 25 or 26. 

28. A method of eliciting an immune response in an individual, said method comprising 
administering to said individual a composition comprising the polypeptide of Claim 27 or a 

2 0 fragment thereof. 

29. A method of eliciting an immune response in an individual, said method comprising 
administering to said individual a composition comprising a nucleic acid comprising a sequence of 
a nucleic acid of Table 1 , Table 2. Table 3, Table 4 or Table 5 or a fragment thereof. 

30. A method for detennining the prognosis of an individual with a disorder associated with 

25 angiogenesis comprising detennining the level of a AMP in a sample, wherein a high level of the 
AMP indicates a poor prognosis. 
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31 . A method of treating a disorder associated with angiogenisis comprising administering to an 
individual having a disorder associated with angiogenesis an antibody to a AMP or fragment 

thereof conjugated to a therapeutic moiety. 

32. The method of Claim 31 , wherein said therapeutic moiety is a cytotoxic agent. 

33. The method of Claim 31 , wherein said therapeutic moiety is a radioisotope. 
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FIGURE 1 
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FIGURE 2 



TGCCCTTCCCCGTGCTCCTTCTGGCCGCTCTGCCTCCGGTGcS^^ 

^^^^^ At!;^^!; ' i™^^ !™5^???^^'=CTTAGTTTTTG 



AACAAAGAAAATCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTACATGTTCTCrTTTf-n,^'^:;;'^":* " 
GTAAATGCGGCAGTTACAAATTAACTGTTGGAGGm ^^^^^ ^ 



CTTTGACAATACATTCAGCACCATTTCTGAGAAGGTGATTTTCTTTGAATTAATCC^^T^TA^ 



FIGURE 3 
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FIGURE 4 



MGDKIWLPFPVL LLAALPPVLLPGAAGFTP.qT.nqnFTPT, Pnr:nKPnpvMOMOT -Try-T r"'" T 
UiDFHLASPtGKTLVFEORKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFELILDNMGEQAQEQEDWK 
KYITGTDILDMKLEDILESINSIKSRLSKSGHIQTLLRAFEARDRNIQESNFDRVN FWSMVNLVVMVW R 
^^QVyMt KSLFEDKRKSRT. 
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FIGURE 5 



Peptide Name: AAA4p1 



Sequence: H-Cys-Met-Leu-Lys-Ser-Leu-Phe-Glu-Asp-Lys 
-Arg-Lys-Ser-Arg-Thr-OH 



Peptide Name: AAA4p2 



Sequence: H-Cys-Ala-Gly-Phe-Thr-Pro-Ser-Leu-Asp-Ser-Asp 
-Phe-Thr-Phe-Thr-NHj 
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FIGURE 6 



I III! lllPllj 



I 



I I 



"lull 
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FIGURE 7 



TAAAAATCGAGCTGAGATGATAGATTTCAATATCCGGATCflAAAATGTGACAAGAAGTGATGCGGGGAAATATCGTTGTG 
RAGTTAGTGCCCCATCTGAGCAAGGCCAARACCTGGAAGAGGATACAGTCACTCTGGAAGTRTTAGTGGCTCCAGCAGTT 
CCATCATGTGAAGTACCCTCTTCTGCTCTGAGTGGAACTGTGGTAGAGCTACGATGTCAAGACAAAGAAGGGAATCCAGC 
TCCTGAATACACATGGTTTAAGGATGGCATCCGTTTGCTAGAAAATCCCAGACTTGGCTCCCARAGCACCAACAGCTCAT 
ACACAATGAATACAAAAACTGGAACTCTGCAATTTAATACTGTTTCCAAACTGGACACTGGAGAATATTCCTGTGAAGCC 
CGCAATTCTGTTGGATATCGCAGGTGTCCTGGGAAACGAATGCAAGTAGATGATCTCAACATAAGTGGCATCATAGCAGC 

AAACCTCCTTCCAGAAGAGTAATTCTTCATCTAAAGCCACGACAATGAGTGAAAATGATTTCAAGCACACAAAATCCTTT 
ATAATTTAAAGACTCCACTTTAGAGATACACCAAAGCCACCGTTGTTACACAAGTTATTAAACTATTATAAAACTCTGCT 
TTGTCCGACATTTGCAAAGAGGTACACGAGGAAATGGAATTGGTATTTCATTTTAATTTTCATGACTACTAACTCACCTG 
AACTTGCTATTTTAAACAAATAGTTCTGTCGACACCTAAAATATAATCTGGCTTCTTGTGTCTGGACTAAGTTAAAAGAA 
TTAAAATACTTTGTAATGTCAAAAA 



KNRAEMIDFNIRIKNVTRSDAGKYRCEVSAPSEQGQNLEEDTVTLEVLVAPAVPSCEVPSSALSGTVVELRCQDKEGNPA 
PF.YTWFKDGIRLLENPRLGSQSTNSSYTMNTKTGTLQFNTVSKLDTGEYSCEARNSVGYRRCPGKRMQVDDLNISG IIAA 
VVWALVrSVCGiii GVCYAQRKGYFSKETSFQKSNSSSKATTMSENDFKHTKSFII . 
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FIGURE 9 



Peptide Name: AAA1p1 



Sequence: H-Cys-AIa-Thr-Thr-WIet-Ser-GIu-Asn-Asp-Phe-Lys 
-His-Thr-Lys-Ser-NH, 



Peptide Name: AAA1p2 



Sequence: Ac-Arg-Cys-GIn-Asp-Lys-GIu-Gly-Asn-Pro-Ala-Pro 
-Gfu-Tyr-Thr-NH^ 
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FIGURE 10 



LJLuaJJLijJlji 



IB. a 



llilllll iff !i 



§E5EEf|-t|'ti-^5|-tl'|eS§5i|S2|l'lfl.iil 
I .... ..„|.«.<? -r.^. a" e^»- 
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StSSccSggggcccaccagcgtcccgctggtcaaggcccaccgcagctcggtctc 

?IiJr^STGMCGCTATATCACAATGCTGAAAATGAAACTCCACAACGGGAGCAATAA 

™S™SSaatcagcgcctgctgggtcatct 
SSSSSggL^ctgcatcagtgcgctgtccagctgctccaccgtgctgcc^^ 

n^S^IrAArCAcSTATCCTCTTCTGCACCACGGTCTTCACTCTGCTTCTGCTCTCCAT 

rScAAGScATTTCCAAGGCCAGCCGCAGCTCTGAGAATGTGGCGCTGCTCAAGACCGT 
W?^CCTCCTGAGCGTCTTCATCGCCTGCTGGGCACCGCTCTTCATCCTGCTCCTGCT 
??I^r^rGGCTGCAAGGTGAAGACCTGTGACATCCTCTTCAGAGCGGAGTACTTCCTGGT 

c^SgSg^Stc^ctccggcaccaaccccatcatttacac^ 

r^??cGGG?CTTCATCCGGATCATGTCCTGCTGCAAGTGCCCGAGCGGAGACTCTGCTGG 
rSTSSGAcJcATCATCGCCGGCATGGAATTCAGCCGCAGCAAATCGGACAATTC 
cJ^ScSccSScGAAGGGGACAACCCAGAGACCA^ 
cI^SSc55cc?i^AACTGGAAGCTGTCCACCCACCGG 

rTGGCCACCCCAGTGTTTGGAAAAAAATCTCTGGGCTTCGACTGCTGCCAGGGAGGAGCT 
GCTGCAAGCCAGAGGGAGGAAGGGGGAGAATACGAACAGCCTGGTGGTGTCGGGTGTTGG 
?GGGTAGAGTTAGTTCCTGTGAACAATGCACTGGGAAGGGTGGAGATCAGGTCCCGGCCT 
GGAATATATATTCTACCCCCCTGGAGCTTTGATTTTGCACTGAGCCAAAGGTCTAGCATT 

gt^Stcctaaagggttcatttggcccctcctcaaagactaatgtccccatgtgaaag 

CGTCTCTTTGTCTGGAGCTTTGAGGAGATGTTTTCCTTCACTTTAGTTTCAAACCCAAGT 

gagtgtgtgcacttctgcttctttagggatgccctgtacatcccacaccccaccctccct 

TCCCTTCATACCCCTCCTCAACGTTCTTTTACTTTATACTTTAACTACCTGAGAGTTATC 

agagctggggttgtggaatgatcgatcatctatagcaaataggctatgttgagtacgtag 

GCTGTGGGAAGATGAAGATGGTTTGGAGGTGTAAAACAATGTCCTTCGCTGAGGCCAAAG 

tttccatgtaagcgggatccgttttttggaatttggttgaagtcactttgatttctttaa 

AAAACATCTTTTCAATGAAATGTGTTACCATTTCATATCCATTGAAGCCGAAATCTGCAT 
^AAGCCCACTTTATCTAAATGATATTAGCCAGGATCCTTGGTGTCCTAGGAGAAACA 
GACAAGCAAAACAAAGTGAAAACCGAATGGATTAACTTTTGCAAACCAAGGGAGATTTCT 

SgStgagtctaacaaatatgacatccgtctttcccacttttgttgatgtttatttc 

AGAATCTTGTGTGATTCATTTCAAGCAACAACATGTTGTATTTTGTTGTGTTAAAAGTAC 

ttttcttgatttttgaatgtatttgtttcaggaagaagtcattttatggatttttctaac 

CCGTGTTAACTTTTCTAGAATCCACCCTCTTGTGCCCTTAAGCATTACTTTAACTGGTAG 

ggaacgccagaacttttaagtccagctattcattagatagtaattgaagatatgtataaa 

TATTACAAAGAATAAAAATATATTACTGTCTCTTTAGTATGGTTTTCAGTGCAATTAAAC 

cgagagatgtcttgtttttttaaaaagaatagtatttaataggtttctgacttttgtgga 

TCATTTTGCACATAGCTTTATCAACTTTTAAACATTAATAAACTGATTTTTTTAAAG 



FIGURE 11 
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FIGURE 12 



ATGGGGCCCACCAGCGTCCCGCTGGTCAAGGCCCACCGCAGCTCGGTCTCTGACTACGTCAACTATGATATCATCGTCCG 
GCATTACAACTACACGGGAAAGCTGAATATCAGCGCGGACAAGGAGAACAGCATTAAACTGACCTCGGTGGTGTTCATTC 
TCATCTGCTGCTTTATCATCCTGGAGAACATCTTTGTCTTGCTGACCATTTGGAAAACCAAGAAATTCCACCGACCCATG 
TACTATTTTATTGGCAATCTGGCCCTCTCAGACCTGTTGGCAGGAGTAGCCTACACAGCTAACCTGCTCTTGTCTGGGGC 
CACCACCTACAAGCTCACTCCCGCCCAGTGGTTTCTGCGGGAAGGGAGTATGTTTGTGGCCCTGTCAGCCTCCGTGTTCA 
GTCTCCTCGCCATCGCCATTGAGCGCTATATCACAATGCTGAAAATGAAACTCCACAACGGGAGCAATAACTTCCGCCTC 
TTCCTGCTAATCAGCGCCTGCTGGGTCATCTCCCTCATCCTGGGTGGCCTGCCTATCATGGGCTGGAACTGCATCAGTGC 



TGCTCTCCATCGTCATTCTGTACTGCAGAATCTACTCCTTGGTCAGGACTCGGAGCCGCCGCCTGACGTTCCGCAAGARC 
ATTTCCAAGGCCAGCCGCAGCTCTGAGAATGTGGCGCTGCTCAAGACCGTAATTATCGTCCTGAGCGTCTTCATCGCCTG 
CTGGGCACCGCTCTTCATCCTGCTCCTGCTGGATGTGGGCTGCAAGGTGAAGACCTGTGACATCCTCTTCAGAGCGGRGT 
ACTTCCTGGTGTTAGCTGTGCTCAACTCCGGCACCAACCCCATCATTTACACTCTGACCAACAAGGAGATGCGTCGGGCC 
TTCATCCGGATCATGTCCTGCTGCAAGTGCCCGAGCGGAGACTCTGCTGGCAAATTCAAGCGACCCATCATCGCCGGCAT 
GGAATTCAGCCGCAGCAAATCGGACAATTCCTCCCACCCCCAGAAAGACGAAGGGGACAACCCAGAGACCATTATGTCTT 
CTGGAAACGTCAACTCTTCTTCCTAG 
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FIGURE 13 



MGPTSVPLVKflHRSSVSDYVNYDIIVRHYNYTGKLNISADKENSIKLTS WFILICCFIILENIFVLLTIW KTKKFHRPM 
YYFIGNLALS DLLAGVAYTAHLLLSGATTY KLTPAQWFLRE GSMFVALSASVFSLLAIAI ERYITMLKMKLHNGSNNFRL 
rLLISACWVISLILGG LPIMGWNCISALSSCSTVLPLYHKH YILFCTTVFTLLLLSIVILYC RIYSLVRTRSRRLTFRKN 
ISKASRSSEN VALLKTVIIVLSVFIACWA PLFILLLLDVGCK VKTCDILFRAEYFLVLftVL NSGTNPIIYTl.TNKEMRRA 
FIRIMSCCKCPSGDSAGKFKRPIIAGMEFSRSKSDNSSHPQKDEGDNPETIMSSGNVNSSS. 
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FIGURE 14 



I Peptide names~l | amino acid sequence | I Solubility J 

AAA7p1 "] [ AC-KLNISADKENSIKLC-NH2 | 1mg/1mlH20 I 

AAA7p2 I H-CTTYKLTPAQWFLRE-NH2 jTiln.amt.DMS0/H2Cj 

AAA7p3 I H-CNPIIYTLTNKEMRR-NH2 | 1mg/1mlH20 | 

AAA7p1m I ~~Ac-KLNiGAEKDHGiKLC-NH2 | 1mg/1mlH20 I 
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FIGURE 15 
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FIGURE 17 



CTGGGGCCCCCGGCGCCGACCCCCGCTCGTGCCG^ 



GGGGGCl-TCAACTTAGACGCGGAGGCCCCAGCAGTAC 



STC^GCrGGCCGSxCC^^^^^ 

^o^^Sggggctcga^g^ttSga^^^ 

A^^r^^^CGCCGAT^C^R^^GCTCT^^ 
T^rccCAGCCC^ACATTATCAG^ 

gSam^cSg^cct^ctttccatgccca 

Sgtcaccgcccctccagaggctgagtactcaggactcgtcagacacccagg^^ 

rIcCTGTGACTACTTTGCCGTGAACCAGAGCCGCCTGCTGGl'GTGTGRCCTGGGCARCCCCATGAAGGCA 
CCACrclG^CTGTGGGGTGGCC^TC^ 

ttgaStcScScaagaatctcaaca^^^^ 
gcStcaggcccaggtcaccctg^^ 

TGGCATCrSACCAG^ 

^toc^ggccccmctccattagccagggtgtgctgg^^ 

gc^^ctcctatatgtgaccagagttacgggactcaactgcaccaccaatcaccccattaacccaaagg^^ 
cSttggavcccgagggttccctgcaccaccagcaaaaacgggaagctc^^ 

cctcgggacctcagatcctgaaatgcccggaggctgagtgtttcaggctgcgctgtgagctcgggcccct 

GCACCAACAAGAGAGCCAWiGTCTGCAGTTGCATTTCCGAGTCTGGGCCAAGACTTTCTTGCRGCGGGAG 
CACCAGCCArTTAGCCTGCAGTGTGAGGCTGTGTACAAAGCCCTGAAGATGCCCTACCGAATCCTGCCTC 



CGTCCCACTGTGGATCATCATCCTAGCCATCCTGTTTGGCCTCCTGCTCCTAGGTCTACTCATCTACATC 
CTCTACAAGCTTGGAl'TCTTCAAACGCTCCCTCCCATATGGCACCGCCATGGAAARAGCTCAGCTCAAGC 
CTCCAGCCACCTCTGATGCCTGAGTCCTCCCAATTTCAGACTCCCATTCCTGAAGAACCAGTCCCCCCAC 

cctcattctactgaaaaggaggggtctgggtacttcttgaaggtgctgacggccagggagaagctcctct 
cccctigcccagagacatacttgaagggccagagccaggggggtgaggagctggggatccctcccccccat 
gcactgtgaaggacccttgtttacacataccctcttcatggatgggggaactcagatccagggacagagg 

CCCAGCCTCCCTGAAGCCTTTGCATT'1'TGGAGAGTTTCCTGAAACAACTGGAAAGATAACTAGGAAATCC 
ATTCACAGTTCTTTGiSGCCAGACATGCCACAAGGACTTCCTGTCCAGCTCCAACCTGCAAAGATCTGTCC 
rCAGCCTTGCCAGAGATCCAAAAGAAGCCCCCAGTAAGAACCTGGAACTTGGGGAGTTAAGACCTGGCAG 
CTCTGGACAGCCCCACCCTGGTGGGCCAACAAAGAACACXAACTATGCATGGTGCCCCAGGACCAGCTCA 
GGACAGATGCCACAAGGATAGATGCTGGCCCAGGGCCAGAGCCCAGCTCCAAGGGGAATCAGAACTCAAA 
TCGGGCCAGATCCAGCCTGGCGTCTGGAGTTGATCTGGAACCCAGACTCAGACATTGGCACCAATCCAGG 

cagatccagractatatttgggcctgctccagacctgatcctggaggcccagttcaccctgatttaggag 
aagccaggaatttcccaggacctgaaggggccatgatggcaacagrtctggaacctcagcctggccagac 
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BSPLHAVQLRWCSPRRRPPLVPLLLLLVPPFPRVGGFNLDAERPAVLSGPPGSFFGFSVEFYRPGTDGVSVLVGA 

PKANTSOPGVLQGGAVYLCPWGftSPTQCTPIEFDSKGSRLLESSLSSSEGEEPVEyKSLQWrGATVRAUGSSILnCAPLy 

SWRTEKIiPLSDPVGTCVLSTDlJFTRILEYAPCRSDFSWAAGQGyCQGGFSAEFTKTGRVVLGGPGSYFWQGOILSATQEC! 

IAF,SyYPI!YLINLVQGQLQTRQASSiyDDSYLGYSVAVGErSGDDTEDFVAGVPKGNLTTOYVTrLNGSDIRSI.YNFSGE 

QMASyFGyAVAATDVNGDGLDDLLVGAPIJ^DRTPDGRPQEVGRVYVyLQHPAGIEPTPTLTWGHDErcRFGSSI.TFI.G 

DLDODGYNDVAIGAPFGGETQQGVVFVFPGGPGGLGSKPSQVLQPLWAASHTPDFFGSALRGGRDLDGNGYPDLrJGSFG 

VDKAWyRGRPlVSASASLTIFPAMFNPEERSCSLEGNPVACINLSrCLNASGKHVADSIGFTVF.LQLDWQKOKGGVRWv 

LFLASRQATLTQTLLIQNGAREDCREMKIYLRNESEFRDKLSPIHIALNFSLDPQAPVDSHGLRPALHYQSKSRIEDKAQ 

IU..DCGEDNICVPDLQLEVFGEOHKVyLGDKNALNI-TFHAQNVGEGGAYEAELRVTAPPEAEYSGLVRHPGNFSSL£CUY 

FAVNQ3RI.LVCDI.GNPMKAGASLWGGLRFTVPHLROTI<KTI0FDr01LSKNLNNSQSDVVSFRLSVEAQACVTLNGVSKP 

EAVLFPVSDWHPRDQPQKEEDLGPAVHHVYELINOGPSSISOGVLELSCFCALEGOOLLYVTRVTGI.NCrTNHPimKGL 

ELDPEGSLHHOQKRFAPSRSSASSGPQILKCPEAECFRLRCELGPLHQQESQSLC.LHFRVWAKTFLQREHQPrSLQCEAV 

YKAU<MPYRILPRQLPQKERQVATAVOHTKAEGSYGVPLH IIILAILFGLLLLGLLIYT T.V,.T.,:....v..,..y.TMlCmn 
1.KPPATSDA 
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AAA9 cDNA Sequence 



GGGCACCl^BiGARCTGCTTCAAGTGACCATTCTTTTTCTTCTGCCCAGTATTTGCAGCflGTAACflGCflCAB 
GTGTTTTAGAGGCAGCTflATflATTCACTTGTTGTTflCTflCAACAAAACCATCTATftACAACACCAaACACA 
GAATCATTACAGAAAAATGTTGTCACACCAACAACTGGAACAACTCCTAAAGGAACAATCACCAATGAATT 
ACTTAAAATGTCTCTGATGTCflACAGCTACTTTTTTAACAAGTARAGATGAAGGATTGARAGCCACAflCCA 
CTGATGTCAGGAAGAATGACTCCATCATTTCAAACGTAJ'.CAGTAACAAGTGTTACACTTCCCAATGCTGTT 
TCAACATTACAAAGTTCCAAACCCAAGACTGAAACTCAGAGTTCRATTAAAACAACAGAAATACCAGGTAG 
TGTTCTACAACCAGATGCATCACCTTCTAAAACTGGTflCATTAACCTCAATACCAGTTACAATTCCAGAAA 
ACACCTCACRGTCTCAAGTAATAGRCACTGAGGGTGGA^'.AAAATGCAAGCACTTCAGCAACCAGCCGGTCT 
TATTCCAGTATTATTTTGCCGGTGGTTATTGCTTTGATTGTAATAACACTTTCAGTATTTGTTCTGGTGGG 
TTTGTACCGAATGTGCTGGAAGGCAGATCCGGGCACACCAGAAAATGGAAATGATCAACCTCAGTCTGATA 
AAGAGAGCGTGAAGCTTCTTACCGTTAAGACAATTTCTCATGAGTCTGGTGAGCACTCTGCACAAGGAAAA 
ACCAAGAAd^aCAGCTTGAGGAATTCTCTCCACACCTAGGCAATAATTACGCTTAATCTTCAGCTTCTAT 
GCACCAAGCGTGGAAAAGGAGAAAGTCCTGCAGAATCAATCCCGACTTCCATACCTGCTGCTGGACTGTAC 
CAGACGTCTGTCCCAGTAAAGTGATGTCCAGCTGACATGCAATAATTTGATGGARTCAAAAAGAACCCCGG 
GGCTCTCCTGTTCTCTCRCATTTAAAAATICCATTACTCCATTTACRGGAGCGTTCCTAGSAAAAGGAATT 
TTAGGAGGAGAATTTGTGAGCAGTGAATCTGACAGCCCftGmGgTGGGCTCGCTGATAGGCATGACTTTCC 
TTAATGTTTAAAGTTTTCCGGGCCAAGAAITTTTATCCATGAAGACTTTCCTACTTTrCTCGGTGTTCTTA 
TATTACCTACTGTTAGTATTTATTGTTTACCACTATGTTAATGCAGGGAAAAGTTGCACGTGTATTATTAA 
ATATTAGGTAGAAATCATACCATGCTACTTTGTACATATAAGTATTTTATTCCTGCTTTCGTGTTACTTII 
AATAAATAACTACTGTACTCAATACTCTAAAAATACTATAACATGACTGTGAAAAIGGCAATGTTATTGTC 
TTCCTATAATTATGAATATTTTTGGATGGATTATTAGAATACATGAACTCACTAATGAflAGGCATTTGTAA 
TAAGTCAGAftAGGGACATAGGATTCACATATCAGACTGTTAGGGGGAGAGNTARTTATCAGTTCTTTGGTC 
TTTCTATTTGTCATTCATACTATGTGATGAAGATGTAAGTGCAAGGGCATTTATAACACTATACTGCRTTC 
ATTAGATAT 



FIGURE 21 



Protein 



MEIXQVTiLEXLPp^^ LWTTTKPS I TT PNTESLQKNWTPTTGTTPKGT ITNELLK 

MSLMSTATFLTSKDEGLKATTTDVRKNDSIISNVTVTSVTLPNAVSTLQSS KPKTETQSSIKTTEIPGSV L 
QPDASPSKTGTLTSIPVTIPENTSQSQVIXTEGGKNASTSATSRSYSSIILP| 
RMCWKADPGTPENGNDQPQSDKESVKLLTVKTISHESGEHSAQGKTKN 



FIGURE 22 
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AAB4 (MMPIO) 



CAGCAAAAGAGGAGGACTCCAACAAGGATCTTGCCCAGCAATACCTAGAAAAGTACTACAAC 

CTCGAAAAGGATGTGAAACAGTTTAGAAGAAAGGACAGTAATCTCATTGTTAAAAAAATCCA 

AGGAATGCAGAAGTTCCTTGGGTTGGAGGTGACAGGGAAGCTAGACACTGACACTCTGGAGG 

TGATGCGCAAGCCCAGGTGTGGAGTTCCTGACGTTGGTCACTTCAGCTCCTTTCCTGGCATGCC 

GAAGTGGAGGAAAACCCACCTTACATACAGGATTGTGAATTATACACCAGATTTGCCAAGAG 

ATGCTGTTGATTCTGCCATTGAGAAAGCTCTGAAAGTCTGGGAAGAGGTGACTCCACTCACAT 

TCTCCAGGCTGTATGAAGGAGAGGCTGATATAATGATCTCTTTCGCAGTTAAAGAACATGGAG 

ACTTTTACTCTTTTGATGGCCCAGGACACAGTTTGGCTCATGCCTACCCACCTGGACCTGGGCT 

TTATGGAGATATTCACnTTGATGATGATGAAAAATGGACAGAAGATGCATCAGGCACCAATTT 

ATTCCTCGTTGCTGCTCATGAACTTGGCCACTCCCTGGGGCTCTTTCACTCAGCCAACACTGAA 

GCITTGATGTACCCACTCTACAACrCATTCACAGAGCTCGCCCAGTTCCGCCTTTCGCAAGATG 

ATGTGAATGGCATTCAGTCTCTCTACGGACCTCCCCCTGCCTCTACTGAGGAACCCCTGGTGCC 

CACAAAATCTGTTCCTTCGGGATCTGAGATGCCAGCCAAGTGTGATCCTG(nTTGTCCTTCGAT 

GCCATCAGCACTCTGAGGGGAGAATATCTGTTCTTTAAAGACAGATATTTTTGGCGAAGATCC 

CACTGGAACCCTGAACCTGAATTTCATTTGATTTCTGCATTTTGGCCCTCTOTCCATCATAT^ 

GGATGCTGCATATGAAGTTAACAGCAGGGACACCGTTTTTATTTTTAAAGGAAATGAGTTCTG 

GGCCATCAGAGGAAATGAGGTACAAGCAGGTTATCCAAGAGGCATCCATACCCTGGGTTTTC 

CTCCAACCATAAGGAAAATTGATGCAGCTGTTTCTGACAAGGAAAAGAAGAAAACATACTTC 

TTTGCAGCGGACAAATACTGGAGATTTGATGAAAATAGCCAGTCCATGGAGCAAGGCTTCCCT 

AGA CTAAT AGCTGATGACTTTCCAGGAGTTGAGCCTAAGGTTGATGCTGTATTACAGGCATTT 

GGATTTTTCTACTTCTTCAGTGGATCATCACAGITTGAGTTTGACCCCAATGCCAGGATGGTGA 

CACACATATTAAAGAGTAACAGCTGGTTACATTGCTCTAGABg 
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